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HUMAN CALCIUM CHANNEL COMPOSITIONS AND METHODS USING THEM 

TECHNICAL FIELD 

The present invention relates to molecular biology and 
pharmacology. More particularly, the invention relates to 
calcium channel compositions and methods of making and using 
the same. 

BACRGROraiD OF THE INVENTION 

Calcium channels are membreuie- spanning, multi-submiit 
proteins that allow controlled entry of Ca'* ions into cells 
from the extracellular fluid. Cells throughout the animal 
kingdom, and at least some bacterial, fungal and plant cells, 
possess one or more types of calcium chcumel. 

The most common type of calcium channel is voltage 
dependent. "Opening" of a voltage -dependent channel to allow 
an influx of Ca** ions into the cells requires a depolarization 
to a certain level of the potential difference between the 
inside of the cell bearing the channel and the extracellular 
medium bathing the cell. The rate of influx of Ca^* into the 
cell depends on this potential difference. All "excitable" 
cells in animals, such as neurons of the central nervous 
system (CNS) , peripheral nerve cells and muscle cells, 
including those of skeletal muscles, cardiac muscles, and 
venous and arterial smooth muscles, have voltage -dependent 
calcium channels. 

Multiple types of calcium channels have been identified 
in mammalian cells from various tissues, including skeletal 
muscle, cardiac muscle, lung, smooth muscle and brain, [see, 
e.g., Bean, B. P. (1989) Ann. Rev. PhyBiol. 51:367-384 and Hess, 
P. (1990) Ann. J^ev. Neurosci, 56^:337] . The different types of 
calcium channels have been broadly categorized into four 
classes, T-, cOid P-type, distinguished by current 

kinetics, holding potential sensitivity and sensitivity to 
calcium channel agonists and cuitagonists . 

Calcium channels are multisubtinit proteins that contain 
two large subunits, designated or^ cuid a,, which have molecular 
weights between about 130 and about 200 kilodaltons ("kD"), 
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and one to three different smaller subunits of less than about 
60 kD in molecular weight. At least one of the larger 
sxibunits and possibly some of the smaller subunits are 
glycosylated. Some of the subunits are capable of being 
phosphorylated. The subunit has a molecular weight of 
about 150 to about 170 JcD when analyzed by sodium 
dodecylsulf ate (SDS) -polyacrylamide gel electrophoresis (PAGE) 
after isolation from mammalian muscle tissue and has specific 
binding sites for various l,4~dihydropyridines (DHPs) and 
phenylalkylamines. Under non-reducing conditions (in the 
presence of N-ethylmaleimide) , the subunit migrates in 
SDS- PAGE as a band corresponding to a molecular weight of 
about 160-190 kD. Upon reduction, a laz^e fragment and 
smaller fragments are released. The P subunit of the rabbit 
skeletal muscle calcium channel is a phosphorylated protein 
that has a molecular weight of 52-65 'kD as determined by SDS- 
PAGE analysis. This siibunit is insensitive to reducing 
conditions. The y subunit of the calcium channel, which is 
not observed in all purified preparations, appears to be a 
glycoprotein with an apparent molecular weight of 30-33 kD, as 
determined by SDS-PAGB analysis. 

In order to study calcium channel structure and function, 
large amounts of pure channel protein are needed. Because of 
the complex nature of these multisubimit proteins, the varying 
concentrations of calcium channels in tissue sources of the 
protein, the presence of mixed populations of calcium channels 
in tissues, difficulties in obtaining tissues of interest, and 
the modifications of the native protein that can occur during 
the isolation procedure, it is extremely difficult to obtain 
large amoimts of highly purified, completely intact calcium 
channel protein. 

Characterization of a peorticular type of calcium channel 
by analysis of whole cells is severely restricted by the 
presence of mixed populations of different types of calcium 
cheumels in the majority of cells. Single-channel recording 
methods that are used to examine individual calcium channels 
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do not reveal any Information regarding the molecular 
structure or biochemical composition of the channel. 
Furthermore, in performing this type of analysis, the channel 
is isolated from other cellular constituents that might be 
important for natural functions and pharmacological 
interactions* 

Oiaracterization of the gene or genes encoding Calcium 
chcumels provides another means of characterization of 
different types of calcium channels. The amino acid sequence 
determined from a coiT5)lete nucleotide sequence of the coding 
region of a gene encoding a calcium channel protein represents 
the primary structiire of the protein. Furthermore, secondary 
structure of the calcium channel protein and the relationship 
of the protein to the membrane may be predicted based on 
euialysis of the primary structure. For instance t hydrqpathy 
plots of the ofi subxmit protein of the rabbit skeletal muscle 
calcium chsumel indicate that it contains four internal 
repeats, each containing six putative transmembrane regions 
[Tanabe, T. at al. (1987) Mature 325:313]. 

Because calcium channels are present in various tissues 
and have a central role in regulating intracellular calcium 
ion concentrations, they are implicated in a number of vital 
processes in animals, including neurotransmitter release, 
muscle contraction, pacemaker activity, and secretion of 
hormones and other substances. These processes appear to be 
involved in numerous human disorders, such as CNS and 
Ceurdiovascular diseases. Calcium channels, thus, are also 
implicated in numerous disorders. A number of compounds 
useful for treating various cardiovascular diseases in 
animals, including humans, are thought to exert their 
beneficial effects by modulating functions of voltage- 
dependent calcium channels present in cardiac emd/or vascular 
smooth muscle. Many of these compounds bind to calcium 
cheumels £uid block, or reduce the rate of, influx of Ca^* into 
the cells in response to depolarization of the cell membrane. 
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The results of studies of recombinant expression of 
rabbit calcium channel ot^ subunit -encoding cDNA clones and 
transcripts of the cDNA clones indicate that the subunit 
forms the pore through which calcium enters cells. The 
relevance of the barium currents generated in these 
recombinant cells to the actual current generated by calcium 
channels containing as one conqx>nent the respective 
subunits in vivo is unclear. In order to completely and 
accurately characterize and evaluate different calcium channel 
types, however, it is essential to examine the fxinctional 
properties of recombinant channels containing all of the 
siibunits as found in vivo. 

In order to conduct this examination and to fully 
understand calcium channel structure and function, it is 
critical to identify and characterize as many calcium channel 
subunits as possible. Also in order to prepare recombinant 
cells for use in identifying compounds that interact with 
calcium channels, it is necessary to be able to produce cells 
that e3q)ress uniform populations of calcium channels 
containing defined subunits. 

An understanding of the pharmacology of compounds that 
interact with calcium channels in other organ systems, such as 
the CNS, may aid in the rational design of compounds that 
specifically interact with subtypes of hxamcin calcium channels 
to have desired therapeutic effects, such as in the treatment 
of neturodegenerative and cardiovascular disorders. Such 
understanding and the ability to rationally design 
therapeutically effective compounds, however, have been 
hampered by an inability to independently determine the types 
of human calcium channels and the molecular nature of 
individual subtypes, particularly in the CNS, and by the 
unavailability of pure preparations of specific channel 
subtypes to use for evaluation of the specificity of calcium 
channel -effecting compounds. Thus, identification of DNA 
encoding human calcium channel subunits and the use of such 
DNA for expression of calcium channel subunits and functional 
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calcium channels would aid in screening and designing 
therapeutically effective compoxinds. 

Therefore, it is an object herein, to provide DNA 
encoding specific calcium channel subunits and to provide 
eukaryotic cells bearing recombinant tissue-specific or 
subtype- specific calcium channels. It is also an object to 
provide assays for identification of potentially therapeutic 
compounds that act as calcium channel antagcmists and 
agonists . 
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SUtStSARY OF THE XNVEMTION 

Isolated and purified nucleic acid fragments that encode 
human calcium channel subunits are provided. DNA encoding 
subuaiits of a humsm calcium channel, and RNA, encoding such ^ 
subunits, made upon transcription of such DNA are provided. 
In particular, DNh fragments encoding subunits of voltage- 
dependent human calcium channels (VDCCs) type A, type B (also 
referred to as VDCC IV), type C (also referred to as VDCC II) 
type D (also referred to as VDCC III) and type E are provided. 

DNA encoding of^^, a^^, a^^, oc^ and Qf^g siibunits is provided. 
DNA encoding an subunit that includes the amino acids 
substantially as set forth as residues 10-2161 of SEQ ID No. 
1 is provided. DNA encoding an subunit that includes 

STobstantially the amino acids set forth as amino acids 1-34 in 
SEQ ID No. 2 in place of amino acids 373-406 of SEQ ID No. 1 
is also provided. DNA encoding an a^^ subunit that includes 
the amino acids substantially as set forth in SEQ ID No. 3 or 
SEQ ID No. 6 and DNA encoding an a^^^ subunit that includes an 
amino acid sequence substantially as set forth in SEQ ID No. 
7 or in SEQ ID No. 8 is also provided. 

DNA encoding Of^A subunits is also provided. Such DNA 
includes DNA encoding am subunit that has substantially the 
same sequence of amino acids as encoded by the DNA set forth 
in SEQ ID No. 22 or No. 23 or other splice variants of a^^ that 
include all or part of the sequence set forth in SEQ ID No. 22 
or 23. The sequence set forth in SEQ ID NO. 22 is a splice 
variant designated ct^.^s and the sequence set forth in SEQ ID 
NO. 23 is a splice variant designated I^NA encoding 

subunits also include DNA encoding subxinits that can be *' 
isolated using all or a portion of the DNA having SEQ ID NO. 
21, 22 or 23 or DNA obtained from the phage lysate of am E. « 
coli host containing DNA encoding sm or^ sxibimit that has been 
deposited in the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 U.S.A. under 
Accession No. 75293 in accord with the Budapest Treaty. The 
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DNA in such phage includes a DNA fragment having the sequence 
set forth in SEQ ID No. 21. This fragment selectively 
hybridizes under conditions of high stringency to DNA encoding 
oiui but not to DNA encoding a^^ and, thus, can be used to 
isolate DNA that encodes (x^ subunits. 

DNA encoding of^ subunits of a human calcium channel is 
also provided. This DNA includes DNA that encodes - an ot^^ 
splice variant designated a^.^ encoded by the DNA set forth in 
SEQ ID No. 24, and a variant designated 0^.3 encoded by SEQ ID 
No. 25. This DNA also includes other splice variants thereof 
that encodes sequences of amino acids encoded by all or a 
portion of the sequences of nucleotides set forth in SEQ id 
Nos. 24 and 25 and DNA that hybridizes under conditions of 
high stringency to the DNA of SEQ ID. No. 24 or 25 and that 
encodes an a^^ splice variant. 

DNA encoding of^ subunits of a human calcium channel, and 
RNA encoding such subunits, made upon transcription of such a 
DNA are provided. DNA encoding splice variants of the 
subunit, including tissue- specific splice variants, are also 
provided. In particular, DNA encoding the a^a-ffae subunit 
subtypes is provided, in particularly preferred embodiments, 
the DNA encoding the siibunit that is produced by 

alternative processing of a primary transcript that includes 
DNA encoding the amino acids set forth in SEQ ID 11 and the 
DNA of SEQ ID No. 13 inserted between nucleotides 1624 and 
1625 of SEQ ID No. 11 is provided. The DNA and amino acid 
sequences of ofj» - ofj^ are set forth in SEQ. ID Nos. 11 and 
29-32, respectively. 

Isolated and purified DNA fragments encoding human 
calcium channel subunits, including DNA encoding p^, fi^, 
and subunits, and splice variants of the ^ subunits are 
provided. RNA encoding /5 subvmits, made upon transcription . of 
the DNA is also provided. 

DNA encoding a jS^ subunit that is produced by alternative 
processing of a primary transcript that includes DNA encoding 
the amino acids set forth in SEQ ID No. 9, but including the 
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DNA set forth in SEQ ID No. 12 inserted in place of 
nucleotides 615-781 of SEQ ID No. 9 is also provided. DNA 
encoding subunits that are encoded by transcripts that have 
the sequence set forth in SEQ ID No. 9 including the DNA set 
forth in SEQ ID No. 12 inserted in place of nucleotides 615- 
781 of SEQ ID No. 9, but that lack one or more of the 
following sequences of nucleotides: nucleotides 14-34 of SEQ 
ID No. 12, nucleotides 13-34 of SEQ ID No. 12, nucleotides 35- 
55 of SEQ ID No 12, nucleotides 56-190 of SEQ ID No. 12 and 
nucleotides 191-271 of SEQ ID No. 12 are also provided. In 
particular, subimit splice variants /Si^x-Z^^.j (see, SEQ ID 
Nos. 9, 10 and 33-35) described below, are provided. 

Ba subunit splice variants P^^-P^^, that include all or a 
portion of SEQ ID Nos. 26, 29 and 30 are provided; fi^ subunit 
splice variants, including ^3 subunit splice variants that 
have the sequences set forth in SEQ ID Nos 19 and 20, and DNA 
encoding the subunit that includes DNA having the sequence 
set forth in SEQ ID No. 27 and the amino acid sequence set 
forth in SEQ ID No. 28 are provided. 

Also Escherichia coli (E. coli) host cells harboring 
plasmids containing DNA encoding have been deposited in 
accord with the Budapest Treaty under Accession No. 69048 at 
the American Type Culture Collection. The deposited clone 
encompasses nucleotides 122-457 in SEQ ID No. 19 and 107-443 
in SEQ ID No. 20. 

DNA encoding p subtmits that are produced by alternative 
processing of a primary transcript encoding a p subtmit, 
including a transcript that includes DNA encoding the amino 
acids set forth in SEQ ID No. 9 or including a primary 
transcript that encodes ^3 as deposited under ATCC Accession 
No. 69048, but lacking and including alternative exons are 
provided or may be constructed from the DNA provided herein. 

DNA encoding 7 subunits of human calcium channels is also 
provided. RNA, encoding y subimits, made upon transcription 
of the DNA are also provided. In particular, DNA containing 
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the sequence of nucleotides set forth in SEQ ID No. 14 is 
provided . 

Full-length DNA clones and corresponding RNA transcripts, 
encoding ot^, including splice variants of a^, ocj^q, a^^, a^^i and 
Ofig, and p subunits, including Pi_i-/8i_5, ^20 ^20. p2Z' and 
of humcm calcium channels are provided. Also provided are 
DNA clones encoding a substantial portions of the certain a^f. 
subtype subunits and y subunits of voltage-dependent human 
calcium channels for the preparation of full-length DNA clones 
encoding the corresponding full-length subunits. Pull-length 
clones may be readily obtained using the disclosed DNA as a 
probe as described herein. 

The the a^^ subunit, ot^c subunit, subunit euid splice 
. variants thereof, the P^d* 02c and sxabunits and fi^ subunits 
cuid nucleic acids encoding these subunits are of particular 
interest herein. 

Eukaryotic cells containing heterologous DNA encoding one 
or more calcium channel subunits, particularly human calcium 
channel subunits, or containing RNA transcripts of DNA clones 
encoding one or more of the subunits are provided. A single 
or^ subunit can form a channel. The requisite combination of 
subunits for formation of active channels in selected cells, 
however, can be determined empirically using the methods 
herein. For example, if a selected subtype or variant does 
not form an active channel in a selected cell line, an 
additional subunit or stibunits can be added until an active 
channel is formed. 

In preferred embodiments, the cells contain DNA or RNA 
encoding a human siibunit, preferably at least an a^^, a^^, a^^ 
or ttiE subunit. In more preferred embodiments, the cells 
contain DNA or RNA encoding additional heterologous subunits, 
* including at least one P, or y subunit. In such 

embodiments, eukaryotic cells stably or transiently 
transfected with euiy combination of one, two, three or four of 
the subunit -encoding DNA clones, such as DNA encoding any of 
ofi, ofi + /5, ofi jff + <X2i are provided. 
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The eukaryotic cells provided herein contain heterologous 
DNA that encodes an subunit or heterologous DNA that 
encodes an subunit and heterologous DNA that encodes a P 
subunit. At least one subunit is selected ^1^.1, ©1^-2^ ctic-i' 
«iE-i' ^2C' ^2D/ P2rt a /Sj.j, /Jj.j siabunit or a subunit. li 

preferred embodiments, the cells express such heterologous 
calcium chaxmel subunits and include one or more of the 
subunits in membrane -spanning heterologous calcium channels* 
In more preferred embodiments, the eukaryotic cells express 
functional, heterologous calcium channels that are capable of 
gating the passage of calcium channel -selective ions and/or 
binding compounds that, at physiological concentrations, 
modulate the activity of the heterologous calcium channel. In 
certain embodiments, the heterologous calcium channels include 
at least one heterologous calcium channel subunit. In most 
preferred embodiments, the calcium channels that are esqpressed 
on the surface of the eukaryotic cells are composed 
substantially or entirely of subtinits encoded by the 
heterologous DNA or RNA. In preferred embodiments, the 
heterologous calcium channels of such cells are 
distinguishable from any endogenous calcium channels of the 
host cell. Such cells provide a means to obtain homogeneous 
populations of calcium channels. Typically, the cells contain 
the selected calcium channel as the only heterologous ion 
channel expressed. by the cell. 

In certain embodiments the recombinant eukaryotic cells 
that contain the heterologous DNA encoding the calcium chsmnel 
subunits are produced by transf ection with DNA encoding one or 
more of the subunits or are injected with RNA transcripts of 
DNA encoding one or more of the calcium channel subunits. The 
DNA may be introduced as a linear DNA fragment or may be 
included in an expression vector for stable or transient 
expression of the subimit-encoding DNA. Vectors containing 
DNA encoding human calcium channel subimits are also provided. 
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The euXaryotic cells that express heterologous calcium 
channels may be used in assays for calcium channel function 
or, in the case of cells transformed with fewer subunit- 
encoding nucleic acids than necessary to constitute a 
functional recombinant human calcium channel, such cells may 
be used to assess the effects of additional subunits on 
calcium channel activity. The additional subunits can be 
provided by siibsequently transfecting such a cell with one or 
more DNA clones or RNA transcripts encoding human calcium 
channel subunits. 

The recombinant eukaryotic cells that express membrane 
spanning heterologous calcium channels may be used in methods 
for identifying compounds that modulate calcium channel 
activity. In particular, the cells are used in assays that 
identify agonists and antagonists of calcium channel activity 
in humans and/or assessing the contribution of the various 
calcium channel subunits to the transport and regulation of 
transport of calcium ions* Because the cells constitute 
homogeneous populations of calcium channels, they provide a 
means to identify agonists or antagonists of calcium channel 
activity that are specific for each such population. 

The assays that use the eukaryotic cells for identifying 
compounds that modulate calcium channel activity are also 
provided. In practicing these assays the eukaryotic cell that 
expresses a heterologous calcium channel, containing at least 
on subunit encoded by the DNA provided herein, is in a 
solution containing a test compound and a calcium channel 
selective ion, the cell membrane is depolarized, and current 
flowing into the cell is detected. If the test compoimd is 
one that modulates calcium channel activity, the current that 
is detected is different from that produced by depolarizing 
the same or a substantially identical cell in the presence of 
the same calcium channel -selective ion but in the absence of 
the compound. in preferred embodiments, prior to the 
depolarization step, the cell is maintained at a holding 
potential which substantially inactivates calcium channels 



-12- 



which are endogenous to the cell. Also in preferred 
embodiments, the cells are mammalian cells, most prefercQ>ly 
HEK cells, or anrphibian oocytes. 

Nucleic acid probes, typically labeled for detection, 
containing at least about 14, preferably 16, or, if desired, 
20 or 30 or more, contiguous nucleotides of a^, a^^, ot^, 
and ffa* 0' including /S^, »Py and /S^ splice variants and 
7 subunit- encoding MIA are provided. Methods using the probes 
for the isolation and cloning of calcium channel subunit- 
encoding DNA, including splice variants within tissues and 
inter- tissue variants are also provided- 

Purified humem calcium channel subiinits and purified 
human calcium channels are provided. The subunits and 
channels can be isolated from a eukaryotic cell transfected 
with DNA that encodes the subunit. 

In auiother embodiment, immunoglobulins or antibodies 
obtained from the serum of an animal immunized with a 
substantially pure preparation of a human calcium channel, 
human calcium channel subunit or epitope- containing fragment 
of a human calcium subunit are provided. Monoclonal 
antibodies produced using a human calcium channel, htiman 
calcium cha nn el subimit or epitope -containing fragment thereof 
as an immunogen are also provided. coli fusion proteins 

including a fragment of a humsui calcium channel subunit may 
also be used as immunogen. Such fusion proteins may contain 
a bacterial protein or portion thereof, such as the E. coli 
TrpE protein, fused to a calcium channel subunit peptide. The 
immunoglobulins that are produced using the calcium chamnel 
subunits or purified calciiim channels as immunogens have, 
among other properties, the ability to specifically and 
preferentially bind to and/or cause the immunoprecipitation of 
a human calcium channel or a subunit thereof which may be 
present in a biological sample or a solution derived from such 
a biological saii9>le. Such antibodies may also be used to 
selectively isolate cells that express calcium channels that 
contain the subunit for which the antibodies are specific. 
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Methods for modulating the activity of ion channels by 
contacting the calcium channels with an effective amount of 
the above-described antibodies are also provided. 

A diagnostic method for determining the presence of 
Lambert Eaton Syndrome (LBS) in a human based on immunological 
reactivity of LES immunoglobulin G (igG) with a human calcium 
channel subunit or a eukaryotic cell which expresses a 
recombinant human calcium channel or a subunit thereof is also 
provided. In particular, an immunoassay method for diagnosing 
Lambert-Eaton Syndrome in a person by combining serum or an 
IgG fraction from the person (test serum) with calcium channel 
proteins, including the a and /8 subunits, and ascertaining 
whether antibodies in the test serum react with one or more of 
the subunits. or a recombinant cell which expresses one or 
more of the subunits to a greater extent than antibodies in 
control serum, obtained from a person or group of persons 
known to be free of the Syndrome, is provided. Any 
immunoassay procedure known in the art for detecting 
antibodies against a given antigen in serum can be employed in 
the method. 

DETAILED DBSCRlPnOir OP THE mVEHTlOH 
Definitions : 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as is commonly 
understood by one of skill in the art to which this invention 
belongs. All patents and publications referred to herein are 
incorporated by reference herein. 

Reference to each of the calcium channel subunits 
includes the subunits that are specifically disclosed herein 
and human calcium channel subunits encoded by DNA that can be 
isolated by using the DNA disclosed as probes and screening an 
appropriate human cDHA or genomic library under at least low 
stringency. Such DNA also includes DNA that encodes proteins 
that have about 40% homology to any of the subunits proteins 
described herein or DNA that hybridizes under conditions of at 
least low stringency to the DNA provided herein and the 



-14- 



protein encoded by such DNA exhibits additional identifying 
characteristics, such as function or molecular weight. 

It is understood that subunits that are encoded by 
transcripts that represent splice variants of the disclosed 
subunits or other such subunits may exhibit less than 40% 
overall homology to any single subunit, but will include 
regions of such homology to one or more such subunits. It is 
also understood that 40% homology refers to proteins that 
share approximately 40% of their amino acids in common or that 
share somewhat less, but include conservative amino acid 
substitutions, whereby the activity of the protein is not 
substantially altered. 

As used herein, the or^ subunits types, encoded by 
different genes, are designated as type otjj^, a^^, a^c> «id 
ofiE- These types have also been referred to as VDCC IV for a„, 
VDCC II for ofic and VDCC III for a^^- Subunit subtypes, which 
are splice variants, are referred to, for example as a^.^, a^g. 
2 , o^ic-i etc • 

Thus, as used herein, DNA. encoding the subunit refers 
to DNA that hybridizes to the DNA provided herein under 
conditions of at least low stringency or encodes a subunit 
that has at least about 40% homology to protein encoded by DNA 
disclosed herein that encodes an subunit of a human 
calcium. An ofi subunit may be identified by its ability to 
form a calcium channel. Typically, siibxmits have 

molecular masses greater than at least about 120 kD. Also, 
hydropathy plots of deduced subunit amino acid sequences 
indicate that the subunits contain four internal repeats, 
each containing six putative transmembrane domains. 

The activity of a calcium channel may be assessed in 
vitro by methods known to those of skill in the art, including 
the electrophysiological etnd other methods described herein. 
Typically, subunits include regions to which one or more 
modulators of calcium channel activity, such as a 1,4-DHP or 
w-CgTx, interact directly or indirectly. Types of subunits 
may be distinguished by any method known to those of skill in 
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the art. including on the basis of binding specificity. For 
example, it has been found herein that a« subunits participate 
in the formation channels that have previously been referred 
to as N-type channels, a„ subunits participate in the 
formation of channels that had previously been referred to as 
L-type channels, and subunits appear to participate in the 
formation of channels that exhibit characteristics typical of 
channels that had previously been designated P-type channels . 
Thus, for example, the activity of channels that contain the 
Of« subunit are insensitive to 1,4-DHPs; whereas the activity 
of channels that contain the a„ subunit are modulated or 
altered by a 1.4-DHP, it is presently preferable to refer to 
calcium channels based on pharmacological characteristics and 
current kinetics and to avoid historical designatiems . Types 
and subtypes of a, subunits may be characterized on the basis 
of the effects of such modulators on the subunit or a channel 
containing the subunit as well as differences in currents and 
current kinetics produced by calcium channels containing the 
subunit . 

As used herein, an ofj subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 40% 
homology with that disclosed herein. Such DNA encodes a 
protein that typically has a molecular mass greater than about 
120 kD, but does not form a calcium channel in the absence of 
an or, subunit. and may alter the activity of a calcium channel 
that contains an ot, subunit. Subtypes of the a, subunit that 
arise as splice variants are designated by lower case letter, 
such as ttj,, . . . ofj^. In addition, the a, subunit and the 
large fragment produced when the protein is subjected to 
reducing conditions appear to be glycosylated with at least 
N-linked sugars and do not specifically bind to the 1,4-DHPs 
and phenylalkylamines that specifically bind to the cr, 
subunit. The smaller fragment, the C- terminal fragment, is 
referred to as the 6 subunit and includes amino acids from 
about 946 (SEQ ID No. 11) through about the C-terminus. This 
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f ragment may dissociate from the remaining portion of when 
the 0^2 svibunit is exposed to reducing conditions. 

As used herein, a /9 subunit is encoded by DNA that 
hybridizes to the DNA provided herein under conditions of low 
stringency or encodes a protein that has at least about 40% 
homology with that disclosed herein and is a protein that 
typically has a molecular mass lower than the a subunit s and 
on the order of about 50-80 kD, does not form a detectable 
calcium channel in the absence of an oi^ subxmit, but may alter 
the activity of a calcium channel that contains an subunit 
or that contains an and ofa subunit. 

Types of the subxmit that sure encoded by different 
genes are designated with subscripts, such as P^, p^, and 0^. 
Subtypes of P subtmits that arise as splice variants of a 
particular type are designated with a numerical subscript 
referring to the type and to the variant. Such subtypes 
include, but are not limited to the P^ splice variants, 
including p^.^-Pi.^ and p^ variants, including p2c''P2E' 

As used herein, a y subunit is a subunit encoded by DNA 
disclosed herein as encoding the y subunit and may be isolated 
and identified using the DNA disclosed herein as a probe by 
hybridization or other such method known to those of skill in 
the art, whereby full-length clones encoding a y siibunit may 
be isolated or constructed. A 7 subimit will be encoded by 
DNA that hybridizes to the DNA provided herein imder 
conditions of low stringency or exhibits sufficient sequence 
homology to encode a protein that has at least about 40% 
homology with the y subunit described herein. 

Thus, one of skill in the art, in light of the disclosure 
herein, can identify DNA encoding a^, ofj, P, 6 and 7 calcium 
chamnel subunits, including types encoded by different genes 
and subtypes that represent splice variants. For example, 
DNA probes based on the DNA disclosed herein may be used to 
screen an appropriate library, including a genomic or cDNA 
library, for hybridization to the probe and obtain DNA in one 
or more clones that includes an open reading fragment that 
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encodes an entire protein. Subsequent to screening an 
appropriate library with the DNA disclosed herein, the 
isolated DNA can be examined for the presence of an open 
reading fraioe from which the sequence of the encoded protein 
may be deduced. Determination of the molecular weight and 
comparison with the sequences herein should reveal the 
identity of the sxibunit as am a^, aj etc. subiinit. Functional 
assays may, if necessary, be used to determine whether the 
subunit is an a^, ol^ subunit or P subunit. 

For exainple, DNA encoding an ot^ subunit may be isolated 
by screening an appropriate library with DNA, encoding all or 
a portion of the human ot^ subunit. Such DNA includes the DNA 
in the phage deposited under ATCC Accession No. 75293 that 
encodes a portion of an subunit. DNA encoding an ot^ 
subimit may obtained from an appropriate library by screening 
with an oligonucleotide having all or a portion of the 
sequence set forth in SEQ ID No. 21, 22 and/or 23 or with the 
DNA in the deposited phage. Alternatively, such DNA may have 
a sequence that encodes an ol^ siabunit that is encoded by SEQ 
ID NO. 22 or 23. 

Similarly, DNA encoding ft may be isolated by screening 
a human cDNA library with DNA probes prepared from the plasmid 
/51.42 deposited under ATCC Accession No. 69048 or obtained 
from an appropriate library using probes having sequences 
prepared according to the sequences set forth in SEQ ID Nos. 
19 and/or 20. Also, DNA encoding fi^ may be isolated by 
screening a human cDNA library with DNA probes prepared 
according to DNA set forth in SEQ ID No. 27, which sets forth 
the DNA sequence of a clone encoding a subunit. The amino 
acid sequence is set forth in SEQ ID No. 28. Any method known 
to those of skill in the art for isolation cuid identification 
of DNA and preparation of full-length genomic or cDNA clones, 
including methods exeitplif led herein, may be used. DNA 
encoding 

The si±>unit encoded by isolated DNA may be identified by 
comparison with the DNA and amino acid sequences of the 
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subunits provided herein. Splice variants share extensive 
regions of homology, but include non-homologous regions, 
subunits encoded by different genes share a itnifoim 
distribution of non- homologous sequences. 

As used herein, a splice variant refers to a variant 
produced by differential processing of a primary transcript of 
genomic DNA that results in more than one type of mRlD^. 
Splice variants may occult within a single tissue type or among 
tissues (tissue-specific variants) • Thus, cDNA clones that 
encode calcium channel subunit subtypes that have regions of 
identical amino acids and regions of different amino acid 
sequences are referred to herein as "splice variants". 

As used herein, a "calcium channel -selective ion" is an 
ion that is capable of flowing through, or being blocked from 
flowing through, a calcium channel which spans a cellular 
membrane under conditions which would substantially similarly 
permit or block the flow of Ca^"^, Ba^* is an example of an ion 
which is a calcium channel -selective ion. 

As used herein, a compound that modulates calcium channel 
activity is one that affects the ability of the calcium 
channel to pass calcium channel -selective ions or affects 
other detectable calcium channel features, such as current 
kinetics. Such compounds include calcium channel antagonists 
cuid agonists and compounds that exert their effect on the 
activity of the calcium channel directly or indirectly. 

As used herein, a "substantially pure" subunit or protein 
is a subunit or protein that is sufficiently free of other 
polypeptide contaminants to appear homogeneous by SDS-PA6E or 
to be umambiguously sequenced. 

As used herein, selectively hybridize means that a DNA 
fragment hybridizes to a second fragment with sufficient 
specificity to permit the second fragment to be identified or 
isolated from among a plurality of fragments. In general, 
selective hybridization occurs at conditions of high 
stringency. 
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As used herein, heterologous or foreign DNA and RNA are 
used interchangeably and refer to DNA or RNA that does not 
occur naturally as part of the genome in which it is present 
or which is found in a location or locations in the genome 
that differ from that in which it occurs in nature. It is DNA 
or RNA that is not endogenous to the cell and has been 
artificially introduced into the cell. Examples of 
heterologous DNA include, but are not limited to, DNA that 
encodes a calcium channel subunit and DNA that encodes RNA or 
proteins that mediate or alter expression of endogenous DNA by 
affecting transcription, translation, or other regulatable 
biochemical processes. The cell that expresses the 
heterologous DNA, such as DNA encoding a calcium channel 
subunit, may contain DNA encoding the same or different 
calcium channel subunits. The heterologous DNA need not be 
expressed and may be introduced in a meumer such that it is 
integrated into the host cell genome or is maintained 
episomally. 

As used herein, operative linkage of heterologous DNA to 
regulatory smd effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational stop 
sites, and other signal sequences, refers to the functional 
relationship between such DNA and such sequences of 
nucleotides. For example, operative linkage of heterologous 
DNA to a promoter refers to the physical and ftinctional 
relationship between the DNA and the promoter such that the 
transcription of such DNA is initiated from the promoter by an 
RNA polymerase that specifically recognizes, binds to and 
transcribes the DNA in reading frame. 

As used her-ein, isolated, substantially pure DNA refers 
to DNA fragments purified according to standard techniques 
employed by those skilled in the art [see, e.g., Maniatis et 
al. (1982) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY] . 

As used herein, expression refers to the process by which 
nucleic acid is transcribed into mRNA and translated into 
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peptides, polypeptides, or proteins. If the nucleic acid is 
derived from genomic DNA, expression may, if an appropriate 
eukaryotic host cell or orgamism is selected, include splicing 
of the mRNA. 

As used herein, vector or plasmid refers to discrete 
elements that are used to introduce heterologous DMA into 
cells for either expression of the heterologous DNA or for 
replication of the cloned heterologous DNA. Selection and use 
of such vectors and plasmids are well within the level of 
skill of the art. 

As used herein, expression vector includes vectors 
capable of expressing DNA fragments that are in operative 
linkage with regulatory sequences, such as promoter regions, 
that are capcible of effecting expression of such DNA 
fragments. Thus, an expression vector refers to a recombinant 
DNA or RNA construct, such as a plasmid, a phage, recombinant 
virus or other vector that, upon introduction into an 
appropriate host cell, results in expression of the cloned 
DNA. Appropriate expression vectors are well known to those 
of skill in the art euid include those that are replicable in 
eukaryotic cells and/or prokaryotic cells and those that 
remain episomal or may integrate into the host cell genome. 

As used herein, a promoter region refers to the portion 
of DNA of a gene that controls transcription of DNA to which 
it is operatively linked. The promoter region includes 
specific sequences of DNA that are sufficient for RNA' 
polymerase recognition, binding euid transcription initiation. 
This portion of the promoter region is referred to as the 
promoter. In addition, the promoter region includes sequences 
that modulate this recognition, binding and transcription 
initiation activity of the RNA polymerase. These sequences 
may be cis acting or may be responsive to trans acting 
factors. Promoters, depending upon the nature of the 
regulation, may be constitutive or regulated. 

As used herein, a recombinant eulcaryotic cell is a 
eukaryotic cell that contains heterologous DNA or RNA. 
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As used herein, a recombinant or heterologous calcium 
channel refers to a calcium channel that contains one or more 
subunits that are encoded by heterologous DNA that has been 
introduced into and expressed in a eukaryotic cells that 
expresses the recombinant calcium channel. A recombinant 
calcium channel may also include subunits that are produced by 
DNA endogenous to the cell. in certain embodiments, the 
recombinant or heterologous calcium channel may contain only 
subunits that are encoded by heterologous DNA. 

As used herein, "functional" with respect to a 
recombinant or heterologous calcium channel means that the 
channel is able to provide for and regulate entry of calcium 
channel -selective ions, including, but not limited to, Ca^* or 
Ba^*, in response to a stimulus and/ or bind ligands with 
affinity for the channel. Preferably such calcium channel 
activity is distinguishable, such as electrophysiological, 
pharmacological and other means known to those of skill in the 
art, from any endogenous calcium channel activity that in the 
host cell. 

As used herein, a peptide having an amino acid sequence 
substantially as set forth in a particular SEQ ID No. 
includes peptides that have the same function but may include 
minor variations in sequence, such as conservative amino acid 
changes or minor deletions or insertions that do not alter the 
activity of the peptide. The activity of a calcium channel 
receptor subunit peptide refers to its ability to form 
functional calcium channels with other such subunits. 

As used herein, a physiological concentration of a 
compoiind is that which is necessary and sufficient for a 
biological process to occur. For example, a physiological 
concentration of a calcium channel -selective ion is a 
concentration of the calcium channel -selective ion necessary 
and sufficient to provide an inward current when the channels 
open. 

As used herein, activity of a calcium channel refers to 
the movement of a calcium channel-selective ion through a 
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calcium channel . Such activity may be measured by any method 
known to those of skill in the art, including, but not limited 
to, measurement of the amount of current which flows through 
the recombinant channel in response to a stimulus. 

As used herein, a "functional assay" refers to an assay 
that identifies functional calcium channels , A functional 
assay, thus, is an assay to assess function. 

As understood by those skilled in the art, assay methods 
for identifying coii5)ounds, such as antagonists and agonists, 
that modulate calcium channel activity, generally requires 
comparison to a control. One type of a "control" cell or 
"control" culture is a cell or culture that is treated 
substantially the same as the cell or culture e^qposed to the 
test con^und except that the control culture is not exposed 
to the test cott?)ound. Another type of a "control" cell or 
"control" culture may be a cell or a culture of cells which 
are identical to the transfected cells except the cells 
employed for the control culture do not express functional 
calcium channels. In this situation, the response of test 
cell to the test compound is compared to the response (or lack 
of respcmse) of the calcium channel -negative cell to the test 
compound, when cells or cultures of each type of cell are 
e3<posed to substantially the same reaction conditions in the 
presence of the compound being assayed. For exanple, in 
methods that use patch clamp electrophysiological procedures, 
the same cell can be tested in the presence and absence of the 
test compound, by changing the exteraal solution bathing the 
cell as known in the art- 

It is also understood that each of the subunits disclosed 
herein may be modified by making conservative amino acid 
substitutions and the resulting modified subunits are 
contemplated herein. Suitable conservative substitutions of 
amino acids are known to those of skill in this art and may be 
made generally without altering the biological activity of the 
resulting molecule. Those of skill in this art recognize 
that, in general, single amino acid substitutions in non-^ 
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essential regions of a polypeptide do not substantially alter 
biological activity (see, e.g. , Watson et al^ Molecular 
Biology of the Gene, 4th Edition, 1987, The Bejacmin/Cummings 
Pub- CO., p.224) . Such substitutions are preferably, although 
not exclusively, made in accordance with those set forth in 
TABLE 1 as follows: 

TABLE 1 



Original residije 


Conservative substitution 


Ala (A) 


Gly; Ser 


Arg (R) 


Lys 


Asn (N) 


Gin; His 


Cys IC) 


Ser 


Gin (Q) 


Asn 


Glu (E) 


Asp 


Gly (G) 


Ala; Pro 


His <H) 


Asn; Gin 


lie (1) 


Uu; Val 


Leu (L) 


He; Val 


Lys (K) 


Arg; Gin; Glu 


Met (M) 


Leu; Tyr; lie 


Phe (F) 


Met; Leu; Tyr 


Ser (8) 


Thr 


Thr (T) 


Ser 


Trp(W) 


Tyr 


Tyr (V) 


Trp; Phe 


Val (V) 


lie; Leu 


Other subs t i tut ions 


are also permissible a 



determined empirically or in accord with known conservative 
substitutions- Any such modification of the polypeptide may 
be effected by any means known to those of skill in this art. 
Mutation may be effected by any method known to those of skill 
in the art, including site-specific or site-directed 
mutagenesis of DNA encoding the protein and the use of DNA 
amplification methods using primers to introduce and amplify 
alterations in the DNA template. 

Idantlflcatlon and Isolation of DMA encoding human calcium 
channel subunits 

Methods for identifying and isolating DNA encoding a^, ot^, 
0 and y subunits of htanan calcium channels are provided. 

Identification and isolation of such DNA may be 
accomplished by hybridizing, under appropriate conditions, at 
least low stringency whereby DNA that encodes the desired 
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siabunit is isolated, restriction enzyme -digested human DMA 
with a labeled probe having at least 14 , preferably 16 or more 
nucleotides and derived from any contiguous portion of DNA 
having a sequence of nucleotides set forth herein by sequence 
identification number- Once a hybridizing fragment is 
identified in the hybridization reaction, it can be cloned 
employing standard cloning techniques known to those of skill 
in the art. Full-length clones may be identified by the 
presence of a complete open reading frame and the identity of 
the encoded protein verified by sequence comparison with the 
subunits provided herein and by functional assays to assess 
calcium channel- forming ability or other function. This 
method can be used to identify genomic DNA encoding the 
sxibunit or cDNA encoding splice variants of human calcium 
channel subunits generated by alternative splicing of the 
primary transcript of genomic subunit DNA. For instance, DNA, 
cDNA or genomic DNA, encoding a calcium channel subunit may be 
identified by hybridization to a DNA probe and characterized 
by methods known to those of skill in the art, such as 
restriction mapping and DNA sequencing, and compcured to the 
DNA provided herein in order to identify heterogeneity or 
divergence in the sequences of the DNA, Such sequence 
differences may indicate that the transcripts from which the 
cDNA was produced result from alternative splicing of a 
primary transcript, if the non-homologous and homologous 
regions are clustered, or from a different gene if the non- 
homologous regions are distributed throughout the cloned DNA. 

Any suitable method for isolating genes using the DNA 
provided herein may be used. For example, oligonucleotides 
corresponding to regions of sequence differences have been 
used to isolate, by hybridization, DNA encoding the full- 
length splice variant and can be used to isolate genomic 
clones. A probe, based on a nucleotide sequence disclosed 
herein, which encodes at least a portion of a siibunit of a 
human calcium channel, such as a tissue-specific exon, may be 
used as a probe to clone related DNA, to clone a full-length 
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cDNA clone or genomic clone encoding the human calcium channel 
subunit . 

Labeled, including, but not limited to, radioactively or 
enzymatically labeled, RNA or single-stranded DNA of at least 
14 substantially contiguous bases, preferably is or more, 
generally at least 30 contiguous bases of a nucleic acid which 
encodes at least a portion of a human calcium channel subunit, 
the sequence of which nucleic acid corresponds to a segment of 
a nucleic acid sequence disclosed herein by reference to a SEQ 
ID No. are provided. Such nucleic acid segments may be used 
as probes in the methods provided herein for cloning DNA 
encoding calcium chaimel subunits. See, generally, Sambrook 
et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd 
Edition, Cold Spring Harbor Laboratory Press. 

In addition, nucleic acid amplification techniques, which 
are well 3cnown in the art, can be used to locate splice 
variants of calcium channel subunits by employing 
oligonucleotides based on DNA sequences surrounding the 
divergent sequence primers for amplifying htunan RNA or genomic 
DNA. Size and sequence determinations of the amplification 
products can reveal splice variants. Furthermore, isolation 
of human genomic DNA sequences by hybridization can yield DNA 
containing multiple exons, separated by introns, that 
correspond to different splice variants of transcripts 
encoding human calcium channel siibunits. 

DNA encoding types and subtypes of each of the a^, ot^, p 
and y subunit of voltage-dependent human calcium channels has 
been cloned herein by nucleic acid amplication of cDNA from 
selected tissues or by screening humam cDNA libraries prepared 
from isolated poly A+ mRNA from cell lines or tissue of human 
origin having such calcium channels. Among the sources of 
such cells or tissue for obtaining mRNA are human brain tissue 
or a hiiraan cell line of neural origin, such as a neuroblastoma 
cell line, human skeletal muscle or smooth muscle cells, and 
the like. Methods of preparing cDNA libraries are well known 
in the art [see generally Ausubel et al. (1987) Current 
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Protocols in Molecular Biology, Wiley- Interscience, New York; 
and Davis et al. (1986) Basic Methods in Molecular Biology, 
Elsevier Science Publishing Co., New York] . 

Preferred regions from which to construct probes include 
5' and/or 3' coding sequences, seqpiences predicted to encode 
transmembrane domains, sequences predicted to encode 
cytoplasmic loops, signal sequences, ligand-binding sites, and 
other functionally significant sequences (see Table, below) . 
Either the full-length subunit -encoding DMA or fragments 
thereof can be used as probes, preferably labeled with 
suitable label means for ready detection. When fragments are 
used as probes, preferably the DMA sequences will be typically 
from the carboxyl-end-encoding portion of the DMA, and most 
preferably will include predicted transmembrane domain- 
encoding portions based on hydropathy analysis of the deduced 
amino acid sequence [see, e.g. , Kyte and Doolittle [(1982) »7. 
Mol. Biol. 1^7:105] • 

Riboprobes that specific for human calcium channel 
subunit types or subtypes have been prepared. These probes 
are useful for identifying expression of particular subunits 
in selected tissues and cells. The regions from which the 
probes were prepared were identified by con^jaring the DMA and 
amino acid sequences of all known a or ^ subunit subtypes. 
Regions of least homology, preferably human-derived sequences, 
and generally about 250 to about 600 nucleotides were 
selected. Numerous riboprobes for a and /8 subunits have been 
prepared; some of these are listed in the following Table. 
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TABLB 2 
SUmMLY OF RK21 PROBES 



SUBDNIT 
SPECIFICITY 



alA generic 



NUCLEOTIDE 
POSITION 

3357-3B40 



761-790 



3440-3464 



3542-3565 



PROBE NAME 



pGEM7ZQflA* 



SE700 



SB71B 



SE724 



PROBE TYPE 



ribqprobe 



oligo 



3ligo 



oligo 



ORIENTA- 
TION 



n/a 



antisense 



antisense 



sense 



oflB generic 



3091-3463 



pGEM72ttlB^ 



riboprobe 



n/a 



6635-6858 



pGEM7ZQflBa 



riboprobe 



n/a 



crlB-l 
specific 



6490-6676 



pCRII 
OflB-1/187 



riboprobe 



a/a 



aXB generic 



3114-3462 



pGEM7ZQflE 



riboprobe 



n/a 



a2b 



1321-1603 



pCRIIOf2b 



riboprobe 



n/a 



p generic (?) 



212-236 



SE300 



oligo 



antisense 



fil generic 



1267-1291 



SE301 



oligo 



antisense 



pi-2 
specific 



1333-1362 



SE17 



oligo 



antisense 



1682-1706 



SE23 



oligo 



sense 



2742-2766 



SE43 



oligo 



antisense 



27-56 



SE208 



oligo 



antisense 



340-364 



SE274 



oligo 



antisense 



340-364 



SE275 



oligo 



sense 



g3 specific 



1309-1509 



riboprobe 



n/a 



P4 specific 



1228-1560 



riboprobe 



n/a 



The pGEM series are available from Promega, Madison WI; see also, 
U.S. Patent No. 4,766,072. 

The above -noted nucleotide regions are also useful in 
selecting regions of the protein for preparation of subunit- 
specific antibodies, discussed below. 

The DNA clones and fragments thereof provided herein thus 
can be used to isolate genomic clones encoding each subunit 
and to isolate any splice variants by hybridization screening 
of libraries prepared from different human tissues. Nucleic 
acid amplification techniques, which are well known in the 
art, can also be used to locate DNA encoding splice variants 
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of human calcium channel subunits. This is accomplished by 
employing oligonucleotides based on DNA sequences surrounding 
divergent sequence (s) as primers for amplifying human RNA or 
genomic DMA. size and sequence determinations of the 
amplification products can reveal the existence of splice 
variants. Furthermore, isolation of human genomic DNA 
sequences by hybridization can yield DNA containing multiple 
exons, separated by introns, that correspond to different 
splice variants of transcripts encoding human calcium channel 
subunits . 

Once DNA encoding a calcium channel subunit is isolated, 
ribonuclease (RNase) protection assays can be employed to 
determine which tissues e3q>ress mRNA encoding a particular 
calcium channel subunit or variant. These assays provide a 
sensitive means for detecting and quantitating an RNA species 
in a complex mixture of total cellular RNA. The subunit DNA 
is labeled and hybridized with cellular RNA. if complementary 
mRNA is present in the cellular RNA, a DNA-RNA hybrid results. 
The RNA sample is then treated with RNase, which degrades 
single-stranded RNA. Any RNA-DNA hybrids are protected from 
RNase degradation and can be visualized by gel electrophoresis 
and autoradiography. in situ hybridization techniques can 
also be used to determine which tissues ejqsress mRNA encoding 
a particular calcium channel subunit. The labeled subunit 
DNAs are hybridized to different tissue slices to visualize 
subiinit mRNA escpression. 

With respect to each of the respective subunits (erj, 
^ or 7) of human calcium channels, once the DNA encoding the 
channel subunit was identified by a nucleic acid screening 
method, the isolated clone was used for further screening to 
identify overlapping clones. Some of the cloned DNA fragments 
can and have been subcloned into an appropriate vector such as 
PIBI24/25 (IBI, New Haven, CT) , M13mpl8/l9, pGEM4, pGEM3, 
PGEM7Z, pSP72 and other such vectors known to those of skill 
in this art, and characterized by DNA sequencing and 
restriction enzyme mapping. a sequential series of 
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overlapping clones may thus be generated for each of the 
subunits until a full-length clone can be prepared by methods, 
Jaiown to those of skill in the art, that include 
identification of translation initiation (start) and 
translation termination (stop) codons. For ea^ression of the 
cloned DNA, the 5' noncoding region and other transcriptional 
and translational control regions of such a clone may be 
replaced with an efficient ribosome binding site and other 
regulatory regions as known in the art. Other modifications 
of the 5' end, known to those of skill in the art, that may be 
required to optimize translation and/or transcription 
efficiency may also be effected, if deemed necessary. 

Examples II-VIIII, below, describe in detail the cloning 
of each of the various subunits of a human calcium channel as 
well as subtypes and splice variants, including tissue- 
specific variants thereof. In the few instances in which 
partial sequences of a subunit are disclosed, it is well 
within the skill of the art, in view of the teaching herein, 
to obtain the corresponding full-length clones and sequence 
thereof encoding the subunit, subtype or splice variant 
thereof using the methods described above and exemplified 
below. 

Identification and isolation of DNA encodino a, 
subunits ^ 

A number of voltage -dependent calcium channel subunit 
genes, which are expressed in the human CNS and in other 
tissues, have been identified and have been designated as a^, 
Of„ (or VDCC IV), a,c (or VDCC II}, a^^ (or VDCC III) and a^. 
DNA, isolated from a human neural cDNA library, that encodes 
each of the subunit types has been isolated. DNA encoding 
subtypes of each of the types, which arise as splice variants 
are also provided. Subtypes are herein designated, for 
example, as aie-i. aiB-2- 

The ai subunits types A B, C, D and E of voltage - 
dependent calcium channels, and subtypes thereof, differ with 
respect to sensitivity to known classes of calcium channel 
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agonists and antagonists, such as DHPs, phenylalJcYlamines, 
omega conotoxin (a)-CgTx) , the funnel web spider toxin 6)-Aga- 
IV, and pyrazonoylguanidines . These subimit types also appear 
to differ in the holding potential and in the kinetics of 
currents produced upon depolarization of cell membranes 
containing calcium channels that include different types of 
subunits . 

DNA that encodes an subimit that binds to at least one 

compound selected from among dihydropyridines, 

phenylalkylamines , w-CgTX, components of funnel web spider 

toxin, and pyrazonoylguanidines is provided. For example, the 

ofiB subimit provided herein appears to specifically interact 

with 6)-CgTx in N-type channels, and the subunit provided 

herein specifically interacts with DHPs in L-type channels. 

Identification and isolation of DNA 
encoding the human calcium channel 
subunit 

The subunit cDNA has been isolated using fragments, of 
the rcd>bit skeletal muscle calcium channel subimit cDNA as 
a probe to screen a cDNA library of a human neuroblastoma cell 
line, IMR32, to obtain clone al.36. This clone was used as a 
probe to screen additional IMR32 cell cDNA libraries to obtain 
overlapping clones, which were then en5)loyed for screening 
until a sufficient series of clones to spsui the length of the 
nucleotide sequence encoding the human subxmit was 

obtained. Full-length clones encoding were constructed by 
ligating portions of partial cLy^ clones as described in Example 
II. SEQ ID No. 1 shows the 7,635 nucleotide sequence of the 
cDNA encoding the subunit. There is a 6,483 nucleotide 
sequence reading frame which encodes a sequence of 2,161 amino 
acids (as set forth in SEQ ID No. 1). 

SEQ ID No. 2 provides the sequence of an alternative exon 
encoding the IS6 transmembrane domain [see Tanabe, T,, et al. 
(19B7) Nature 325:313-318 for a description of transmembrane 
domain terminology] of the ol^^ subimit. 

SEQ ID No. 1 also shows the 2,161 amino acid sequence 
deduced from the human neuronal calcium channel ql^ subunit 
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DNA. Based on the amino acid sequence, the protein has a 

calculated Mr of 245,163. The oi^j, subunit of the calcium 

channel contains four putative internal repeated sequence 

regions. Pour internally repeated regions represent 24 

putative transmembrane segments, and the amino- and 

carboxyl -termini extend intracellular ly» 

The of^D subunit has been shown to mediate DHP-sensitive, 

high-voltage-activated, long-lasting calcium channel activity. 

This calcium channel activity was detected when oocytes were 

CO- injected with RNA transcripts encoding an a^, and ^^.^ or ot^j,, 

Qfjb and subunits. This activity was distinguished from Ba^* 

currents detected when oocytes were injected with RNA 

transcripts encoding the jU^.a ± ^2h subimits. These currents 

pharmacologically and biophysically resembled Ca^* currents 

reported for loninjected oocytes. 

Identification and isolation of SNA 
encoding the human calcium chazmel 
BXibunit 

Biological material containing DNA encoding a portion of 
the Qfy^ subunit had been deposited in the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville, Maryland 20852 
U.S.A. under the terms of the Budapest Treaty on the 
International Recognition of Deposits of Microorgcoiisms for 
Purposes of Patent Procedure and the Regulations promulgated 
under this Treaty. San^les of the deposited material are and 
will be available to industrial property offices and other 
persons legally entitled to receive them under the terms of 
the Treaty and Regulations and otherwise in compliance with 
the patent laws and regulations of the Iftiited States of 
America and all other nations or international organizations 
in which this application, or an application claiming priority 
of this application, is filed or in which any patent granted 
on any such application is granted, 

A portion of an a^^ subunit is encoded by an approximately 
3 kb insert in XgtlO phage designated al.254 in E. coli host 
strain NM514. A phage lysate of this material has been 
deposited as at the American Type Culture Collection under 
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ATCC Accession No. 75293, as described above. DNA encoding 
may also be identified by screening with a probe prepared from 
DNA that has SEQ ID No. 21: 

5 ' CTCa^TACCATCTCTOATACCMCCCCA 3 ' . 

ct^ splice variants have been obtained. The sequences of 
two splice variants, ot^^^ and a^.., are set forth in SEQ. ID 
Nos. 22 and 23. Other splice variants may be obtained by 
screening a human library as described above or rising all or 
a portion of the sequences set forth in SEQ ID Nos. 22 and 23. 

Identification and isolation of DNA 
encoding the a„ human calcium channel 
subunit 

DNA encoding the stibunit was isolated by screening a 
human basal ganglia cDNA library with fragments of the rabbit 
skeletal muscle calcium channel subunit -encoding cDNA. A 
portion of one of the positive clones was used to screen an 
IMR32 cell CDNA library. Clones that hybridized to the basal 
ganglia DNA probe were used to further screen an IMR32 cell 
cDNA library to identify overlapping clones that in turn were 
used to screen a human hippocampus cDNA library. In this way, 
a sufficient series of clones to span nearly the entire length 
of the nucleotide sequence encoding the human subunit was 
obtained. Nucleic acid amplification of specific regions of 
the IMR32 cell or^g mRNA yielded additional segments of the a^^ 
coding sequence. 

A full-length a^^ DNA clone was constiructed by ligating 
portions of the partial cDNA clones as described in Bxanqple 
I I.e. SEQ ID Nos. 7 and 8 show the nucleotide sequences of 
DNA clones encoding the a^a subunit as well as the deduced 
amino acid sequences. The a^^ subunit encoded by SEQ ID No. 
7 is referred to as the a^.^^ subunit to distinguish it from 
another a^^ subunit, c^iB-a/ encoded by the nucleotide sequence 
shown as SEQ ID No. 8, which is derived from alternative 
splicing of the subunit transcript. 

Nucleic acid anipli£ication of IMR32 cell mRNA using 
oligonucleotide primers designed according to nucleotide 
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sequences within the or,,., -encoding DNA has identified variants 
of the a« transcript that appear to be splice variants because 
they contain divergent coding sequences. 

ld«at±f±c«tion azid isolation of sha 
•neoding th* a^e huaan calcium chaoaal 
subunit 

Numerous a,c-Bpecific DNA clones were isolated. 
Characterization of the sequence revealed the a,^ coding 
sequence, the a^^ initiation of translation sequence, and an 
alternatively spliced region of a^^. Alternatively spliced 
variants of the a.^ subunit have been identified. SEQ ID No. 
3 sets forth DNA encoding a substantial protion of an a^^ 
subunit. The DNA sequences set forth in SEQ ID No. 4 and No. 
5 encode two possible amino terminal ends of the a^^ protein. 
SEQ ID No. 6 encodes an alternative exon for the IV S3 
transmembrane domain. The sequences of substantial 

portions of two a,^ splice variants, designated a^c-i and a^^.i. 
are set forth in SEQ ID NOs. 3 and 36, respectively. 

The isolation and identification of DNA clones encoding 
portions of the a^^ subunit is described in detail in Example 
II. 

Idttntl£lcation and Isolation of DHA 
encoding the human calcium channel 
subunit 

DNA encoding a„ human calcium channel subunits have been 
isolated from an oligo dT-primed human hippocampus library. 
The resulting clones, which are splice variants, *rere 
designated o(„-i and a^.,. The subunit designated cr«., has the 
amino acid sequence set forth in SBQ ID No. 24, and a subutnit 
designated a^.j has the amino acid sequence set forth in SEQ 
ID No. 25. These splice variants differ by virtue of a 57 base 
pair insert between nucleotides 2405 and 2406 of SEQ. ID No. 
24. 

The of„ subunits provided herein appear to participate in 
the formation of calcium channels that have properties of 
high-voltage activated calcium channels and low-voltage 
activated channels. These channels are rapidly inactivating 
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compared to other high voltage -activated calcium channels. 

m addition these channels exhibit pharmacological profiles 

that are similar to voltage-activated channels, but are also 

sensitive to DHPs and «-Aga-lVA, which block certain high 

voltage activated channels. Additional details regarding the 

electrophysiology and pharmacology of channels containing a„ 

subunits is provided in Excunple VII. F. 

Identification and isolatim of DBA 
encoding encoding additional human 
calcium channel subunlt types and 
subtypes 

DNA encoding additional subunits can be isolated and 

identified using the DNA provided herein as described for the 

<*iA/ ofiBf "ic. "id and a„ subunits or using other methods known 

to those of skill in the art. in particular, the DMA provided 

herein may be used to screen appropriate libraries to isolate 

related DNA. Full-length clones can be constructed using 

methods, such as those described herein, and the resulting 

subunits characterized by comparison of their sequences and 

electrophysiological and pharmacological properties with the 

subimits exenplified herein. 

Identification and Isolation of moi encoding B 
human calcium channel subunits 

DMA encoding 

To isolate DNA encoding the /S^ subunit, a human 
hippocampus cDNA library was screened by hybridization to a 
DNA fragment encoding a rabbit skeletal muscle calcium channel 
0 subimit. A hybridizing clone was selected and was in turn 
used to isolate overlapping clones until the overlapping 
clones encompassing DNA encoding the entire the hximan calcium 
channel j3 subunit were isolated and sequenced. 

Five alternatively spliced forms of the human calcium 
channel /?j subunit have been identified and DNA encoding a 
number of forms have been isolated. These forms are 
designated ^^j, expreseed in skeletal muscle, /J,.j, expressed 
in the CMS, ^^.j, also expressed in the in the CMS, j8,.«, 
expressed in aorta tissue and HEK 293 cells, and 0^.^, 
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expressed in HEK 293 cells. Pull-length DNA clones encoding 
the ^i.j and ^1.3 subunits have been constructed. The subunits 
ft-ii ^1-2 » /?i-4 and ^1.5 have been identified by nucleic acid 
amplification analysis as alternatively spliced forms of the 
p subunit. Sequences of the splice variants are set forth 
in SEQ ID Nos. 9, 10 and 33-35. 
encoding jS, 

DNA encoding the jS^ splice variants has been obtained. 
These splice variants include ^^c'Pik- Splice variants /Sj^-ZS^ 
include all of sequence set forth in SEQ ID No. 26, except for 
the portion at the 5' end (up to nucleotide 182), which 
differs among splice variants. The sequence set forth in SEQ 
ID No. 26 encodes fi^. Additional splice variants may be 
isolated using the methods described herein and 
oligonucleotides including all or portions of the DNA set 
forth in SEQ ID. No. 26 or may be prepared or obtained as 
described in the Bxanples. The sequences of splice 
variants and jffjE are set forth in SEQ id Nos. 37 and 38, 
respectively- 

DNA encoding fi^ 

DNA encoding the subtmit and any splice variants 
thereof may be isolated by screening a library, as described 
above for the jS^ sub\mit, using DNA probes prepared according 
to SEQ ID Nos. 19, 20 or using all or a portion of the 
deposited clone plasmid /?1.42 (ATCC Accession No. 69048). 

The E. coll host containing plasmid pi. 42 that includes 
DNA encoding a subimit has been deposited as ATCC Accession 
No. 69048 in the American Type Culture Collection, 12301 
Parklawn Drive, Rockville, Maryland 20852 U.S.A. under the 
terms of the Budapest Treaty on the International Recognition 
of Deposits of Microorganisms for Purposes of Patent Procedure 
and the Regulations promulgated under this Treaty. Satrples of 
the deposited material are and will be available to industrial 
property offices and other persons legally entitled to receive 
them vinder the terms of the Treaty and Regulations and 
otherwise in compliance with the patent laws and regulations 
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of the united states of America and all other nations or 
mtemational organizations in which this application, or an 
application claiming priority of this application, is filed or 
in which any patent granted on any such application is 
granted. 

The 0, encoding plasmid is designated The plasmid 

contains a 2.5 kb EaoRI fragment encoding /S, inserted into 
vector pGem-7zF(+) and has been deposited in E. coli host 
strain DHSa. The sequences of /S, splice variants, designated 
/Sj.i and are set forth in SEQ id Nos. is and 20, 

respectively. 

Identification and Isolation of DNA eneodina the a2 
human calcium channel subunit » «m 

DNA encoding a human neuronal calcium channel a, subunit 
was isolated in a manner substantially similar to that used 
for Isolating mt, encoding an subunit, except that a human 
genomic una library was probed under low and high stringency 
conditions with a fragment of DNA encoding the rabbit skeletal 
muscle calcium channel a, subunit. The fragment included 
nucleotides having a sequence corresponding to the nucleotide 
sequence between nucleotides 43 and 272 inclusive of rabbit 
back skeletal muscle calcium channel a, subunit cDNA as 
disclosed in PCT International Patent T^lication Publication 
NO. WO 89/09834, which corresponds to U.S. Application Serial 
No. 07/620,520 (now allowed U.S. Application Serial No. 
07/914,231), which is a continuation-in-part of United States 
Serial No. 176,899, filed April 4, 1988. 

Example IV describes the isolation of DNA clones encoding 
subunits of a human calcium channel from a human DNA 
library using genomic DNA and cDNA clones, identified by 
hybridization to the genomic DNA, as probes. 

SEQ ID Nos. 11 and 29-32 show the sequence of DNA 
encoding a, subunits. As described in Example V, nucleic acid 
amplification analysis of RNA from human skeletal muscle, 
brain tissue and aorta using oligonucleotide primers specific 
for a region of the human neuronal a, subunit cDNA that 
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diverges from the rabbit skeletal muscle calcium channel at, 
subunit cDNA identified splice variants of the human calcium 
channel ofj subunit transcript. 

Identification and isolation of im ancoding y 
human calcium channel subunits 

DMA encoding a portion of a human neuronal calcium 
channel y subunit has been isolated as described in detail in 
Example VI. SEQ ID No. 14 shows the nucleotide sequence at 
the 3 '-end of this DNA which includes a reading frame encoding 
a sequence of 43 amino acid residues, since the portion that 
has been obtained is homologous to the rabbit clone, described 
in allowed co-owned U.S. Application Serial No. 07/482,384, 
the remainder of the clone can be obtained using routine 
methods . 

Antibodies 

Antibodies, monocltaial or polyclonal, specific for 
calcium channel subunit subtypes or for calcium channel types 
can be prepared employing standard techniques, known to those 
of skill in the art, using the subunit proteins or portions 
thereof as antigens. Anti-peptide and anti-fusion protein 
antibodies can be used [see, for example, Bahouth et al. 
(1991) Trends Pharmacol. Sci. 12:338-343; Current Protocols in 
Molecular Biology ^Ausubel et al., eds.) John Wiley and Sons, 
New York (1984)]. Factors to consider in selecting portions 
of the calcium channel subunits for use as immunogens (as 
either a synthetic peptide or a recombinantly produced 
bacterial fusion protein) include antigenicity accessibility 
(i.e., extracellular and cytoplasmic domains), uniqueness to 
the particular subunit, and other factors known to those of 
skill in this art. 

The availability of subunit -specific antibodies makes 
possible the application of the technique of 
immunohistochemistry to monitor the distribution and 
expression density of various subunits ( e.g. . in normal vs 
diseased brain tissue) . such antibodies could also be 
employed in diagnostic, such as LBS diagnosis, and therapeutic 
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applications, such as using antibodies that modulate 
activities of calcium channels. 

The antibodies can be administered to a subject employing 
standard methods, such as, for example, by intraperitoneal, 
intramuscular, intravenous, or subcutaneous injection, implant 
or transdermal modes of administration, and the like. One of 
skill in the art can empirically determine dose forms, 
treatment regiments, etc., depending on the mode of 
administration employed. 

Subunit- specific monoclonal antibodies and polyclonal 
antisera have been prepared. The regions from which the 
antigens were identified by conqparing the DNA and amino acid 
sequences of all known a or 0 subunit subtypes. Regions of 
least homology, preferably human-derived sequences were 
selected. The selected regions or fusion proteins containing 
the selected regions are used as immunogens. Hydrophobicity 
analyses of residues in selected protein regions and fusion 
proteins are also performed; regions of high hydrophobicity 
are avoided. Also, and more iinportantly, when preparing 
fusion proteins in bacterial hosts, rare codons are avoided. 
In particular, inclusion of 3 or more successive rare codons 
in a selected host is avoided. Numerous antibodies, 
polyclonal and monoclonal, specific for a or p subunits types 
or subtypes have been prepared; some of these are listed in 
the following Table. Exemplary antibodies and peptide antigens 
used to prepare the antibodies are set forth in the following 
Table: 



TABLE 3 



SPECIFICITY 


AMINO ACID 
NUMBER 


ANTIGEN NAME 


ANTIBODY TYPE j 


al generic 


112-140 


peptide 1A#1 


polyclonal 


ol generic 


1420-1447 


peptide 1A#2 


polyclonal 


alA generic 


1048-1208 


OflA#2(b)GST fusion* 


polyclonal 
monoclonal 


orlB generic 


983-1106 


GflB#2(b) GST fusion 


polyclonal 

monoclonal 
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OflB-l 



2164-2339 



QflB-l#3 GST fusion 



polyclonal 



alB-2 



2164-2237 



0flB-2#4 GST fusion 



polyclonal 



alE generic 



985-1004 
(alE-3) 



alE#2(a) GST fusion 



polyclonal 



* GST gene tusion system is available from Pharmacia; see also. Smith 
^ (1988) Gene £7:31. The system provides pGEX plasmids that are 
designed for inducible, high-level expression of genes or gene 
fragments as fusions with Schistosoma japanicum GST. Vpon expression 
in a bacterial host, the resulting fusion proteins are purified from 
bacterial lysates by affinity chromatography. 



The GST fusion proteins are each specific for the 

cytoplasmic loop region ilS6-IISl, which is a region of low 

subtype homology for all subtypes, including ot^^ and a^^, for 

which similar fusions and zmtisera can be prepaired. 

Preparation of recoimbinant eulcaryotic cells containing mOi 
encoding heterologous calcium channel subunits 

DMA encoding one or more of the calcium channel subunits 

or a portion of a calcium, channel subunit may be introduced 

into a host cell for expression or replication of the DNA. 

Such DNA may be introduced using methods described in the 

following examples or using other procedures well known to 

those skilled in the art. Incorporation of cloned DNA into a 

suitable eaqpression vector, transfection of eukaryotic cells 

with a plasmid vector or a combination of plasmid vectors, 

each encoding one or more distinct genes or with linear DNA, 

and selection of transfected cells are also well known in the 

art [see, e.g., Sambrook et al. (1989) Molecular Cloning: A 

Laboratory Manual, Second Edition, Cold Spring Harbor 

Laboratory Press] . cloned full-length DNA encoding any of 

the subunits of a human calcium channel may be introduced into 

a plasmid vector for expression in a eukaryotic cell. Such 

DNA may be genomic DNA or cDNA. Host cells may be transfected 

with one or a combination of the plasmids, each of which 

encodes at least one calcium channel subunit. Alternatively, 

host cells may be transfected with linear DNA using methods 

well known to those of skill in the art. 

While the DNA provided herein may be e3q>ressed in any 

eukaryotic cell, including yeast cells such as P. pastoris 
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[see, e.g., Cregg et al. (1987) Bio/Tecbnology 5:479], 
manmialian expression systems for expression of the DNA 
encoding the human calcium channel subxmits provided herein 
are preferred. 

The heterologous DNA may be introduced by any method 
known to those of skill in the art, such as transfection with 
a vector encoding the heterologous DNA. Particularly preferred 
vectors for transfection of mammalian cells are the pSV2dhfr 
expression vectors, which contain the SV40 early promoter, 
mouse dhfr gene, SV40 polyadenylation and splice sites and 
sequences necessary for maintaining the vector in bacteria, 
cytomegalovirus (CMV) promoter-based vectors such as pCDNAl, 
or pcDNA-amp and MMTV promoter-based vectors. DNA encoding 
the human calcium channel subunits has been inserted in the 
vector pCDNAl at a position immediately following the ca4V 
promoter. The vector pCDNAl is presently preferred . 

Stably or transiently transfected mammalian cells may be 
prepared by methods known in the art by transfecting cells 
with an expression vector having a selectable marker gene such 
as the gene for thymidine kinase, dihydrofolate reductase, 
neomycin resistance or the like, and, for trauisient 
transfection, growing the transfected cells under conditions 
selective for cells e^qpressing the marker gene. Functional 
voltage -dependent calcium channels have been produced in HEK 
293 cells transfected with a derivative of the vector pCDNAl 
that contains DNA encoding a human calcium channel subunit. 

The heterologous DNA may be maintained in the cell as an 
episomal element or may be integrated into chromosomal DNA of 
the cell. The resulting recombinant cells may then be 
cultured or sxibcultured (or passaged, in the case of mammalian 
cells) from such a culture or a subculture thereof. Methods 
for transfection, injection and culturing recombinant cells 
are known to the skilled artisan. Eukaryotic cells in which 
DNA or RNA may be introduced, include any cells that are 
transf ectable by such DNA or RNA or into which such DNA may be 
injected. Virtually any etikaryotic cell can serve as a 
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vehicle for heterologous DNA. Preferred cells are those that 
can also express the DNA and RNA and most preferred cells are 
those that can form recombinant or heterologous calcium 
channels that include one or more sxibunits encoded by the 
heterologous DNA. Such cells may be identified empirically or 
selected from among those known to be readily transfected or 
injected. Preferred cells for introducing DNA include those 
that can be transiently or stably transfected and include, but 
are not limited to, cells of mammalian origin, such as COS 
cells, mouse L cells, CHO cells, human embryonic kidney cells, 
African green monkey cells and other such cells known to those 
of skill in the art, amphibian cells, such as Xenopus laevxs 
o&cytes, or those of yeast such as Sacch&roayces cereviaiae or 
Pichia pastoris. Preferred cells for e3q>ressing injected RNA 
transcripts or cDNA include Xenopus laevis oocytes. Cells 
that are preferred for transfection of DNA are those that can 
be readily and efficiently transfected. Such cells are known 
to those of skill in the art or may be empirically identified. 
Preferred cells include DG44 cells and HEK 293 cells, 
particularly HEK 293 cells that can be frozen in liquid 
nitrogen and then thawed and regrown. Such HEK 293 cells are 
described, for example in U.S. Patent No. 5,024,939 to Gorman 
[see, also Stillman et al. (1985) Mol. Cell. Biol. 5:2051- 
2060] . 

The cells may be used as vehicles for replicating 
heterologous DNA introduced therein or for expressing the 
heterologous DNA introduced therein. In certain embodiments, 
the cells are used as vehicles for expressing the heterologous 
DNA as a means to produce substantially pure humcUi calcium 
channel subunits or heterologous calcium channels. Host cells 
containing the heterologous DNA may be cultured under 
conditions whereby the calcitam chaimels are expressed. The 
calcium chauinel subunits may be purified using protein 
purification methods known to those of skill in the art. For 
example, antibodies, such as those provided herein, that 
specifically bind to one or more of the subunits may be used 
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for affinity purification of the subunit or calcium channels 
containing the siibunits. 

Substantially pure subunits of a human calcium channel 
(Xi subunits of a human calcium channel, sxibimits of a human 
calcium chaimel, p subunits of a hioman calcium channel and y 
subunits of a human calcium channel are provided . 
Substantially pure isolated calcium channels that contain at 
least one of the human calcium channel subunits are also 
provided. Sxibstantially pure calcium channels that contain a 
mixture of one or more siibunits encoded by the host cell and 
one or more subimits encoded by heterologous DMA or RNA that 
has been introduced into the cell are also provided. 
Substantially pure subtype- or tissue-type specific calcium 
channels are also provided. 

In other embodiments, eukaryotic cells that contain 
heterologous DNA encoding at least one of an subunit of a 
htiman calcium channel , an ©2 subunit of a human calcium 
channel, a /5 subunit of a human calcium chsmnel and a y 
subimit of a human calcium channel are provided. In 
accordance with one preferred embodiment, the heterologous DNA 
is expressed in the eukaryotic cell and preferably encodes a 
human calcium channel subunit* 

Expression of heterologous calcium channels: 
electrophysiology and phaumacology 

Electrophysiological methods for measuring calcium 
channel activity are kwown to those of skill in the art and 
are exemplified herein. Any such methods may be used in order 
to detect the formation of functional calcium channels and to 
characterize the kinetics and other characteristics of the 
resulting currents. Pharmacological studies may be combined 
with the electrophysiological measurements in order to further 
characterize the calcium channels. 

With respect to measurement of the activity of functional 
heterologous calcium channels, preferably, endogenous ion 
channel activity and, if desired, heterologous channel 
activity of chaimels that do not contain the desired stibunits. 
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of a host cell can be inhibited to a significant extent by 
chemical, pharmacological and electrophysiological means, 
including the use of differential holding potential, to 
increase the S/N ratio of the measured heterologous calcium 
channel activity. 

Thus , various combinations of subimits encoded by the DNA 
provided herein are introduced into eukaryotic cells. The 
resulting cells can be examined to ascertain whether 
functional channels are expressed and to determine the 
properties of the channels.. In particularly preferred 
aspects, the eukaryotic cell which contains the heterologous 
DNA esqpresses it and forms a recombinant functional calcium 
channel activity. In more preferred aspects, the recombinant 
calcium cliannel activity is readily detectable because it is 
a type that is absent from the untransf ected host cell or is 
of a magnitude and/or pharmacological properties or exhibits 
biophysical properties not exhibited in the untransfected 
cell. 

The eulcaryotic cells can be transfected with various 
combinations of the subunit subtypes provided herein. The 
resulting cells will provide a uniform population of calcium 
channels for study of calcium channel activity and for use in 
the drug screening assays provided herein. Experiments that 
have been performed have demonstrated the inadequacy of prior 
classification schemes. 

Preferred among transfected cells is a recombinant 
eukaryotic cell with a functional heterologous calcium 
channel . The recombinant cell can be produced by introduction 
of and expression of heterologous DNA or RNA transcripts 
encoding an Ofi subunit of a human calcium channel , more 
preferably also expressing, a heterologous DNA encoding a /S 
subunit of a human calcium channel and/or heterologous DNA 
encoding an subunit of a human calcium channel. Especially 
preferred is the expression in such a recombinant cell of each 
of the ofi, p and ofj sxibunits encoded by such heterologous DNA 
or RNA transcripts, and optionally expression of heterologous 



-44- 



DNA or an RNA transcript encoding a y subunit of a human 
calcium channel. The functional calcium channels may 

preferably include at least an subunit and a subunit of 
a human calcium channel, Eukaryotic cells expressing these 
two subunits and also cells expressing additional subunits, 
have been prepared by transfection of DMA and by injection of 
RNA transcripts. Such cells have exhibited voltage-dependent 
calcium channel activity attributable to calcium channels that 
contain one or more of the heterologous human calcium channel 
subunits. For example, eukaryotic cells expressing 

heterologous calcium channels containing an subunit in 
addition to the subunit and a p subunit have been shown to 
exhibit increased calcium selective ion flow across the 
cellular membrane in response to depolarization, indicating 
that the subunit may potentiate calcium channel fxinction. 
Cells that have been co-transf ected with increasing ratios of 
©2 to Qfj and the activity of the resulting calcium channels has 
been measured. The results indicate that increasing the 
amount of a,-encoding DNA relative to the other transfected 
subunits increases calcium channel activity, 

Eukaryotic cells which express heterologous calcium 
channels containing at least a human subunit, a human 
subunit and a human of^ subimit are preferred. Eukaryotic 
cells transformed with a composition containing cDNA or an RNA 
transcript that encodes an subunit alone or in combination 
with a P and/or an ofa subunit may be used to produce cells 
that express functional calcium channels. Since recombinant 
cells expressing huraem calcium channels containing all of the 
human subunits encoded by the heterologous cDNA or RNA are 
especially preferred, it is desirable to inject or transfect 
such host cells with a sufficient concentration of the 
subunit-encoding nucleic acids to form calcium channels that 
contain the human subunits encoded by heterologous DNA or RNA. 
The precise amounts and ratios of DNA or RNA encoding the 
subunits may be empirically determined and optimized for a 
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particular conODination of subiinits, cells and assay 
conditions . 

In particular, mammalian cells have been transiently and 
stably tranf ected with DNA encoding one or more human calcium 
channel subunits. Such cells e3q)ress heterologous calcium 
channels that exhibit pharmacological and electrophysiological 
properties that can be ascribed to human calcium channels • 
Such cells, however, represent homogeneous populations and the 
pharmacological and electrophysiological data provides 
insights into human calcium channel activity heretofore 
unattainable. For example, HEK cells that have been 
transiently transf ected with DNA encoding the a^^-i, ajb. and /S^.j 
sxibtmits. The resulting cells transiently express these 
subunits, which form calcium channels that have properties 
that appear to be a pharmacologically distinct class of 
voltage-activated calcium channels distinct from those of 

T" and P-type channels* The observed a^g currents were 
insensitive to drugs and toxins previously used to define 
other classes of voltage -activated calcium chamiels- 

HEK cells that have been trauisf iently transfected with 
DNA encoding a^^.:^, a^h' and p^,^ express heterologous calcium 
channels that exhibt sensitivity to w-conotoxin and currents 
typical of N-type channels. It has been found that alteration 
of the molar raios of a^^.^, ot^ and jS^.^ introduced into the 
cells into to achieve equivalent mRNA levels significantly 
increased the number of receptors per cell, the current 
density, and affected the for w-conotoxin. 

The electrophyiological properties of these channels 
produced from a^^.j, Ofj^, and /S^.j was compared with those of 
channels produced by transiently transfecting HEK cells with 
DNA encoding a^^.^, of^b and /S^.j, The channels exhibited similar 
voltage dependence of activation, substantially identical 
voltage dependence, similar kinetics of activation and tail 
cxirrents that could be fit by a single exponential. The 
voltage dependence of the kinetics of inactivation was 
significantly different at all voltages examined. 



-46- 



In certain embodiments, the eukaryotic cell with a 
heterologous calcium channel is produced by introducing into 
the cell a first composition, which contains at least one rna 
transcript that is translated in the cell into a subunit of a 
human calcium channel, in preferred embodiments, the subunits 
that are translated include an a, subunit of a human calcium 
channel. More preferably, the composition that is introduced 
contains an RNA transcript which encodes an ot, subunit of a 
human calcium channel and also contains (i) an RNA transcript 
v&ich encodes a /? subunit of a human calcium channel and/or 
(2) an RNA transcript which encodes an a, subunit of a human 
calcium channel. Especially preferred is the introduction of 
RNA encoding an or,, a /5 and an a, human calcium channel 
subunit. and, optionally, a y subunit of a human calcium 
channel . Methods for in vitro transcription of a cloned 

DNA and injection of the resulting RNA into eukaryotic cells 
are well known in the art. Transcripts of any of the full- 
length DNA encoding any of the subunits of a human calcium 
channel may be injected alone or in co«4>ination with other 
transcripts into eukaryotic cells for expression in the cells 
Amphibian oficytes are particularly preferred for expression of 
in vitro transcripts of the human calcium channel subunit cDNA 
clones provided herein. Amphibian oocytes that express 
fimctional heterologous calcium channels have been produced by 
this method. 

Assays and Clinical uses of the cells and calcium channels 
Assays 

Assays for Identifying cospounds that 
modulate caleitaoi chazmel activity 

Among the uses for eukaryotic cells which recombinantly 

express one or more subunits are assays for determining 

whether a test compound has calcium channel agonist or 

antagonist activity. These eukaryotic cells may also be used 

to select from among known calcium channel agonists and 

antagonists those exhibiting a particular calcium channel 
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subtype specificity and to thereby select compoiinds that have 
potential as disease- or tissue-specific therapeutic agents. 

In vitro methods for identifying compounds, such as 
calcium channel agonist and antagonists, that modulate the 
activity of calcium channels using eukaryotic cells that 
exprsBe heterologous human calcium channels are provided. 

In particular, the assays use eukaryotic cellB that 
express heterologous human calcium channel subunits encoded by 
heterologous DNA provided herein, for screening potential 
calcium channel agonists and antagonists which are specific 
for human calcium channels and particularly for screening for 
compounds that are specific for particular human calcium 
channel subtypes. Such assays may be used in conjunction with 
methods of rational drug design to select among agonists and 
antagonists, which differ slightly in structure, those 
particularly useful for modulating the activity of human 
calcium channels, and to design or select confounds that 
exhibit subtype- or tissue- specific calcium channel 
antagonist and agonist activities. These assays should 
accurately predict the relative therapeutic efficacy of a 
compound for the treatment of certain disorders in humans. In 
addition, since subtype-and tissue-specific calcium channel 
subunits are provided, cells with tissue- specific or subtype- 
specific recombinant calcium channels may be prepared and used 
in assays for identification of human calcium channel tissue - 
or subtype-specific drugs. 

Desirably, the host cell for the e3q>ression of calcium 
channel subimits does not produce endogenous calcium channel 
subunits of the type or in an amount that substantially 
interferes with the detection of heterologous calcium channel 
subunits in ligand binding assays or detection of heterologous 
calcium channel function, such as generation of calcium 
current, in functional assays. Also, the host cells 
preferably should not produce endogenous calcium channels 
which detectably interact with compounds having, at 
physiological concentrations (generally nanomolar or picomolar 



-48- 



concentrations), affinity for calcium channels that contain 
one or all of the human calcium channel subunits provided 
herein . 

With respect to ligand binding assays for identifying a 
compound which has affinity for calcium channels, cells are 
employed which express, preferably, at least a heterologous 
subunit. Transf acted eukaryotic cells which eacpress at least 
an subunit may be used to determine the ability of a test 
compound to specifically bind to heterologous calcium channels 
by, for example, evaluating the ability of the test coB5>ound 
to inhibit the interaction of a labeled compound known to 
specifically interact with calcium channels. Such ligand 
binding assays may be performed on intact transfected cells or 
membranes prepared therefrom. 

The capacity of a test compound to bind to or otherwise 
interact with membranes that contain heterologous calcium 
channels or subunits thereof may be determined by using any 
appropriate method, such as competitive binding analysis, such 
as Scat chard plots, in which the binding capacity of such 
membranes is determined in the presence and absence of one or 
more concentrations of a compound having known affinity for 
the calcium channel. Where necessary, the results may be 
compared to a control experiment designed in accordance with 
methods known to those of skill in the art. For example, as 
a negative control, the results may be conpared to those of 
assays of an identically treated membrane preparation from 
host cells which have not been transfected with cne or more 
subunit-encoding nucleic acids. 

The assays involve contacting the cell membrane of a 
recombinant eukaryotic cell which expresses at least one 
subunit of a human calcium channel, preferably at least an a, 
subunit of a human calcium channel, with a test compound and 
measuring the ability of the test compound to specifically 
bind to the membrane or alter or modulate the activity of a 
heterologous calcium channel on the membrane. 
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In preferred embodiments, the assay uses a recombinant 
cell that has a calcium channel containing an stibunit of a 
human calcium channel in combination with a p subunit of a 
human calci™ channel and/or an ofj subunit of a human calcium 
channel. Recombinant cells expressing heterologous calcium 
channels containing each of the a^, p and ctj human siibunits, 
and, optionally, a y subomit of a human calcium channel are 
especially preferred for use in such assays. 

In certain embodiments, the assays for identifying 
compounds that modulate calcium channel activity are practiced 
by measuring the calcium channel activity of a eukaryotic cell 
having a heterologous, functional calcium channel when such 
cell is esqposed to a solution containing the test compound and 
a calcium channel -selective ion and comparing the measured 
calcium channel activity to the calcium channel activity of 
the same cell or a substantially identical control cell in a 
solution not containing the test compoiHid. The cell is 
maintained in a solution having a concentration of calcium 
channel -selective ions sufficient to provide an inward current 
when the channels open. Rcombinant cells expressing calcium 
channels that include each of the a^, P and human subunit s, 
and, optionally, a y subunit of a human calcium channel, are 
especially preferred for use in such assays. Methods for 
practicing such assays are known to those of skill in the art. 
For example, for similcu: methods applied with Xenopus laevis 
oocytes and acetylcholine receptors, see, Mishina et al. 
1(1985) Nature 313:364] and, with such oocytes and sodium 
channels [see, Noda et ai. (1986) Nature 322:826-828]. For 
similar studies which have been carried out with the 
acetylcholine receptor, see, e.g., Claudio et al. [(1987) 
Science 238:1688-1694], 

Pimctional recombinant or heterologous calcium channels 
may be identified by any method known to those of skill in the 
art. For example, electrophysiological procedures for 
measuring the ciirrent across ctn ion-selective membrane of a 
cell, which are well ]cnown, may be used. The amoimt and 
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duration of the flow of calcium-selective ions through 
heterologous calcium channels of a recombinant cell containing 
DNA encoding one or more of the sxibunits provided herein has 
been measured using electrophysiological recordings using a 
two electrode and the whole-cell patch clamp techniques. m 
order to inqprove the sensitivity of the assays, known methods 
can be used to eliminate or reduce non- calcium currents and 
calcium currents resulting from endogenous calcium channels, 
when measuring calcium currents through recombinant channels. 

For example, the DHP Bay K 8644 specifically enhances L-type 
calcium channel function by increasing the duration of the 
open state of the channels [see, e.g., Hess, J.B., et al. 
(1984) Nature 312:538-5441 . Prolonged opening of the channels 
results in calcium currents of increased magnitude and 
duration. Tail currents can be observed upon repolarization 
of the cell membrane after activation of ion channels by a 
depolarizing voltage command. The opened channels require a 
finite time to close or "deactivate" upon repolarization, and 
the current that flows through the channels during this period 
is referred to as a tail current. Because Bay K 8644 prolongs 
opening events in calcium channels, it tends to prolong these 
tail currents and make them more pronoimced. 

In practicing these assays, stably or transiently 
transfected cells or injected cells that egress voltage- 
dependent human calcium channels containing one or more of the 
subunits of a human calcium channel desirably may be used in 
assays to identify agents, such as calcium channel agonists 
and antagonists, that modulate calcium channel activity. 
Functionally testing the activity of test compounds, including 
compounds having unlaiown activity, for calcium channel agonist 
or antagonist activity to determine if the test compound 
potentiates, inhibits or otherwise alters the flow of calcium 
ions or other ions through a human calcium cheumel can be 
accomplished by (a) maintaining a eukaryotic cell which is 
transfected or injected to express a heterologous functional 
calcium channel capable of regulating the flow of calcium 
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channel -selective ions into the cell in a medium containing 
calcium channel -selective ions (i) in the presence of and (ii) 
in the absence of a test compound; (b) maintaining the cell 
under conditions such that the heterologous calcium chaimels 
are substantially closed and endogenous calcium channels of 
the cell are substantially inhibited (c) depolarizing the. 
membrane of the cell maintained in step (b) to an extent and 
for an amount of time sufficient to cause (preferably, 
substantially only) the heterologous calcium channels to 
become permeable to the calcium channel-selective ions; and 
(d) comparing the amount and duration of current flow into the 
cell in the presence of the test compound to that of the 
current flow into the cell, or a substantially similar cell, 
in the absence of the test confound. 

The assays thus use cells, provided herein, that es^ress 
heterologous functional calcium channels and measure 
functionally, such as electrophysiologically, the ability of 
a test compound to potentiate, antagonize or otherwise 
modulate the magnitude and duration of the flow of calcium 
channel -selective ions, such as Ca** or Ba**, through the 
heterologous fimctional channel. The amount of current which 
flows through the recombinant calcium channels of a cell may 
be determined directly, such as electrophysiologically, or by 
monitoring an independent reaction which occurs 
intracellularly and which is directly influenced in a calcium 
(or other) ion dependent manner. Any method for assessing 

the activity of a calcium channel may be used in conjunction 
with the cells and assays provided herein. For example, 

in one embodiment of the method for testing a compound for its 
ability to modulate calcium channel activity, the amount of 
current is measured by its modulation of a reaction which is 
sensitive to calcium channel-selective ions and uses a 
eukaryotic cell which expresses a heterologous calcium channel 
and also contains a transcriptional control element 
operatively linked for expression to a structural gene that 
encodes an indicator protein. The transcriptional control 
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element used for transcription of the indicator gene is 
responsive in the cell to a calcium channel-selective ion, 
such as Ca^* and Ba*. The details of such transcriptional 
based assays are described in commonly owned PCX International 
Patent Application No. PCT/US91/5625, filed August 7, 1991, 
which claims priority to copending commonly owned allowed U.S. 
Application Serial No. 07/563,751, filed August 1. 1990; see 
also, commonly owned published PCT International Patent 
Application PCT US92/11090, which corresponds to co-pending 
U.S. Applications Serial Nos. 08/229,150 and 08/244,985. 
Assaye for diagnosis of IiES 
LES is an autoimmune disease characterized by an 
insufficient release of acetylcholine from motor nerve 
terminals which noarmally are responsive to nerve impulses. 
Immunoglobulins (IgG) from LBS patients block individual 
voltage -dependent calcium channels and thus inhibit calcium 
channel activity [Kim and Neher, Science 239:405-408 (1988)], 
A diagnostic assay for Lambert Eaton Syndrome (LES) is 
provided herein. The diagnostic assay for LES relies on the 
immunological reactivity of LES IgG with the human calcium 
channels or particular subunits alone or in combination or 
expressed on the surface of recombinant cells. For example, 
such an assay may be based on iramunoprecipitation of LES IgG 
by the human calcium channel subunits and cells that express 
such subunits provided herein. 
Clinical applicationB 

In relation to therapeutic treatment of various disease 
states, the availability of DNA encoding hioman calcium channel 
subunits permits identification of any alterations in such 
genes (e.g. . mutations) which may correlate with the 
occurrence of certain disease states. In addition, the 
creation of animal models of such disease states becomes 
possible, by epecifically introducing such mutations into 
synthetic DNA fragments can then be introduced into laboratory 
animals or in vitro assay systems to determine the effects 
thereof . 
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Also, genetic screening can be carried out using the 
nucleotide sequences as probes. Thus, nucleic acid samples 
from subjects having pathological conditions suspected of 
involving alteration/modification of any one or more of the 
calcium channel subunits can be screened with appropriate 
probes to determine if any abnormalities exist with respect to 
any of the endogenous calcium channels. Similarly, subjects 
having a family history of disease states related to calcium 
channel dysfunction can be screened to determine if they are 
also predisposed to such disease states. 

EXAMPLES 

The following examples are ' included for illustrative 
purposes only and are not intended to limit the scope of the 
invention, 

EXAMPLE I: PREPARATIOM OF LIBRARIES USED FOR ISOLATION 

OF IXNA SECODING HUMAN NEURONAL VOLTAGE- 
DEPENDENT CALCIUM CHANNEL SDBXINITS 
A. RNA Isolation 

1* IMR32 cells 
IMR32 cells were obtained from the American Type Culture 
Collection (ATCC Accession No. CCL127, Rockville, MD) and 
grown in DMEM, 10% fetal bovine serum, i% 
penicillin/streptomycin (GIBCO, Grand Island, NY) plus 1,0 mM 
dibutyryl cAMP (dbcAMP) for ten days. Total RNA was isolated 
from the cells according to the procedure described by H.C. 
Bimboim [(1988) Nucleic Acids Research 16:1487-1497]. 
Poly (A*) RNA was selected according to standard procedures 
[see, e.g., Sambrook et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press; pg, 
7. 26-7,29] . 

2. Human thalaimis tissue 

Human thalamus tissue (2.34 g) , obtained from the 
National Neurological Research Bank, Los Angeles, CA, that had 
been stored frozen at -70*>C was pulverized using a mortar and 
pestle in the presence of liquid nitrogen and the cells were 
lysed in 12 ml of lysis buffer (5 m guanidinium 
isothiocyanate, 50 mM TRIS, pH 7.4, 10 mM EDTA, 5% /8- 
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mercaptoethanol) . Lysis buffer was added to the lysate to 
yield a final volume of 17 ml. N-laurylsarcosine and Cscl 
were added to the mixture to yield final concentrations of 4% 
and 0.01 g/ml, respectively, in a final volume of is ml. 

The sample was centrifuged at 9,000 rpm in a Sorvall SS34 
rotor for lo min at room temperature to remove the insoluble 
material as a pellet. The supernatant was divided into two 
equal portions and each was layered onto a 2 -ml cushion of a 
solution of 5,7 M CsCl, 0.1 M EDTA contained in separate 
centrifuge tubes to yield approximately 9 ml per tube. The 
samples were centrifuged in an SW41 rotor at 37,000 rpm for 24 
h at 20"C. 

After centrifugation, each RNA pellet was resuspended in 
3 ml ETS (10 mM TRIS, pH 7.4, 10 mM EDTA, 0.2% SDS) and 
combined into a single tube. The RNA was precipitated with 
0.25 M NaCl and two voltunes of 95% etheuiol. 

The precipitate was collected by centrifugation and 
resuspended in 4 ml PK buffer (0.05 M TRis, pH 8.4, o.i4 m 
NaCl, 0.01 M BOTA, 1% SDS). Proteinase K was added to the 
sainple to a final concentration of 200 /ig/ml. The sample was 
incubated at 22 »c for l h, followed by extraction with an 
equal volume of phenol: chloroform risoamylalcohol (50:48:2) two 
times, followed by one extraction with an equal volume of 
chloroform: isoamylalcohol (24:1). The RNA was precipitated 
with ethanol and NaCl. The precipitate was resuspended in 400 
Hi of ETS buffer. The yield of total RNA was approximately 
1.0 mg. Poly A* RNA (30 ^g) was isolated from the total rna 
according to standard methods as stated in Exanple I.A.l. 
B. Library Construction 

Double-stranded cDNA was synthesized according to 
standard methods [see, e.g., Sarabrook et al. (1989) IN: 
Molecular dealing, A Laboratory Manual. Cold Spring Harbor 
Laboratory Press, Chapter 8] . Each library was prepared in 
sxibstantially the same manner except for differences in: 
1) the oligonucleotide used to prime the first strand cDNA 
synthesis, 2) the adapters that were attached to the double- 
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stranded cDNA, 3) the method used to remove the free or xinused 
adapters, and 4) the size of the fractionated cDNA ligated 
into the X phage vector. 

1. IMR32 C23NA library #1 
Single -stranded cDNA was synthesized using IMR32 poly (A*) 
RNA (Example I.A.I.) as a template and was primed using oligo 
(^^) 12-18 (Collaborative Research Inc., Bedford, MA); The 
single-stranded cDNA was converted to double -stranded cDNA and 
the yield was approximately 2/ig. Ecol adapters: 

5'-AATTCGGTACGTACACTCGAGC-3' = 22-mer (SEQ ID No. 15) 
3'- GCCATGCATGTGAGCTCG-5' = 18-mer (SEQ ID No. 16) 
also containing StoaBI and Xhol restriction sites were then 
added to the double-stranded cDNA according to the following 
procedure . 

a. Phosphorylation of 18-mer 

The 18-mer was phosphorylated using standard methods 
[see, e^g. , Sambrook et al. (1989) IN: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Oiapter 8] by combining in a 10 (xl total volume the 18-mer 
(225 pmoles) with ["Pl-y-ATP (7000 Ci/mmole; 1,0 /il) and kinase 
(2 U) and incubating at 37** C for 15 minutes. After 
incubation, 1 /zl 10 mM ATP and an additional 2 U of kinase 
were added and incubated at 37*»C for 15 minutes. 
Kinase was then inactivated by boiling for 10 minutes. 

b. Hybridization of 22-iiier 

The 22-mer was hybridized to the phosphorylated 18-mer 
by addition of 225 pmoles of the 22-mer (plus water to bring 
volume to 15 /il) , and incxibation at 65 *C for 5 minutes- The 
reaction was then allowed to slow cool to room temperature. 

The adapters were thus present at a concentration of 15 
pmoles//zl, and were ready for cDNA-adapter ligation. 

c. Ligation of adapters to emth 

After the EcoRI, SlnaBI, Xhol adapters were ligated to the 
double-stranded cDNA using a standard protocol [see, e.g., 
Sambrook et al. (1989) IN: Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, Chapter 8] , the 
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ligase was inactivated by heating the mixture to 72«C for 15 
minutes. The following reagents were added to the cDNA 
ligation reaction and heated at 37°C for 30 minutes: cDNA 
ligation reaction (20 ^1} , water (24 fil) , lOx kinase buffer (3 
Ml)/ 10 IBM ATP (1 111) and kinase {2^1 of 2 U/^l) . The 
reaction was stopped by the addition of 2 fil 0.5M EDTA, 
followed by one phenol /chloroform extraction and one 
chloroform extraction. 

d. Size Selection and Packaging of cDNA 
The double- stranded cDNA with the £coRI, SnaBI, xhol 
adapters ligated was purified away from the free or iinligated 
adapters using a 5 ml Sepharose CL-4B column (Sigma, St. 
Louis, MO) . 100 fil fractions were collected and those 
containing the cDNA, determined by monitoring the 
radioactivity, were pooled, ethanol precipitated, resuspended 
in TE buffer and loaded onto a 1% agarose gel* After the 
electrophoresis, the gel was stained with ethidium bromide and 
the 1 to 3 kb fraction was cut from the gel. The cDNA 
embedded in the agarose was eluted using the "Geneluter 
Electroelution System" (Invitrogen, San Diego, CA) . The 
eluted CDNA was collected by ethanol precipitation and 
resuspended in TE buffer at 0.10 pmol/^l. The cDNA was 
ligated to 1 /zg of EcoRl digested, dephosphorylated Xgtll in 
a 5 /il reaction volume at a 2- to 4- fold molar excess ratio 
of CDNA over the Xgtll vector. The ligated Xgtll containing 
the cDNA insert was packaged into X phage virions in vitro 
using the Gigapack (Stratagene, La Jolla, CA) kit. The 
packaged phage were plated on an E. coli Y1088 bacterial lawn 
in preparation for screening. 

2. IMR32 cDNA library #2 
This library was prepared as described (Example I.B.I.) 
with the exception that 3 to 9 kb cDNA fragments were ligated 
into the Xgtll phage vector rather than the i to 3 kb 
fragments . 
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3. IMR32 cDNA library #3 
IMR32 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single- stranded cDNA. The primers for 
the first strand cDNA synthesis were random primers 
(hexadeoxy-nucleotides [pd{N),] Cat #5020-1. Clontech. Palo 
Alto, CA) . The double-stranded cDNA was synthesized, EcoRI, 
SnaBI, Xhol adapters were added to the cDNA, the unligated 
adapters were removed, and the double -stranded cDNA with the 
ligated adapters was fractionated on an agarose gel, as 
described in Example I.B.l. The cDNA fraction greater than 
1.8 kb was eluted from the agarose, ligated into Xgtll, 
packaged, and plated into a bacterial lawn of Y1088 (as 
described in Example I.B.l.). 

4. IMR32 CDNA library *4 
IMR32 cell poly (A*) RNA (Example I.A.I.) was used as a 
template to synthesize single-stranded cDNA. The primers for 
the first strand cDNA synthesis were oligonucleotides: 89-365a 
specific for the a„ (VDCC III) type a,-subunit (see Example 
II. A.) coding sequence (the con?>leraentary sequence of nt 2927 
to 2956, SEQ ID No. 1), 89-495 specific for the a.^ (VDCC II) 
type a,-subunit (see Example li.B.) coding sequence (the 
complementary sequence of nt 852 to 873, SEQ ID No. 3), and 
90-12 specific for the a.^-sxibjmit coding sequence ' (the 
complementary sequence of nt 2496 to 2520, SEQ ID No. 3) . The 
CDNA library was then constructed as described (Example 
I.B.3), except that the cDNA size-fraction greater than 1.5 kb 
was eluted from the agarose rather than the greater than 
1.8 kb fraction. 

5. IMR32 COMA library #5 

The CDNA library was constructed as described (Example 
I.B.3.) with the exception that the size -fraction greater than 
1.2 kb was eluted from the agarose rather than the greater 
than 1.8 kb fraction. 

6. Human thalamus ttaxh library #S 

Human thalamus poly (A*) RNA (Example I. A. 2.) was used as 
a template to synthesize single-stranded cDNA. Oligo (dT) was 
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used to prime the first strand synthesis (Example I.B.I.), 
The double-stranded cDNA was synthesized (Example I.B.I.) and 
EcoRI, Kpnl, Ncol adapters of the following sequence: 
5' CCATGGTACCTTCGTTGACG 3'= 20-mer (SEQ ID NO. 17) 
3' GGTACCATGGaAGCAACroCTTAA 5'« 24-mer (SEQ ID NO. 18) 
were ligated to the double- stranded cDNA as described (Exan5)le 
I.B.i.) with the 20-mer replacing the 18-mer and the 24-mer 
replacing the 22-mer. The unligated adapters were removed by 
passing the cDNA-adapter mixture through a 1 ml Bio Gel A- 50 
(Bio-Rad Laboratories, Richmond, CA) column. Fractions (30 
111) were collected and l ^1 of each fraction in the first peak 
of radioactivity was electrpphoresed on a 1% agarose gel. 
After electrophoresis, the gel was dried on a vacuum gel drier 
and exposed to x-ray film. The fractions containing cDNA 
fragments greater than 600 bp were pooled, ethanol 
precipitated, and ligated into Xgtll (Example I.B.I.) . The 
construction of the cDNA library was completed as described 
(Example I.B.I.) . 

C. Bybridizatlon and Washing Conditions 
Hybridization of radiolabelled nucleic acids to 
immobilized DNA for the purpose of screening cDNA libraries, 
DNA Southern transfers, or northern transfers was routinely 
performed in standard hybridization conditions 
[hybridization: 50% deionized formamide, 200 /xg/ml sonicated 
herring sperm DNA (Cat #223646, Boehringer Mannheim 
Biochemicals, Indianapolis, IN), 5 x SSPE, 5 x Denhardt's, 42 
C; wash :0.2 x SSPE, 0.1% SDS, eS'^ C] . The recipes for SSPE 
and Denhardt's and the preparation of deionized formamide are 
described, for example, in Sambrook et ai. (1989) Afolecuiar 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, Chapter 8) • In some hybridizations, lower stringency 
conditions were used in that 10% deionized formamide replaced 
50% deionized formamide described for the standard 
hybridization conditions. 
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The washing conditions for removing the non-specific 
probe from the filters was either high, medium, or low 
stringency as described below: 

1) high stringency: o.l x SSPE, o.l* SDS. ss-c 

2) medium stringency: 0.2 x SSPE, 0.1% SDS. 50«C 

3) low stringency: l.o x SSPE, 0.1* SDS, 50<»C 

It is understood that equivalent stringencies may be achieved 
using alternative buffers, salts and temperatures 

A. Isolation o£ m encoding the a„ subunit 

1. Reference list of partial a„ cDlA clones 

Numerous Q(,o-specific cDNA clones were isolated in order 
to characterize the complete coding sequence plus portions 
of the 5' and 3' untranslated sequences. SEQ ID No. l shows 
the complete ot„ dna coding sequence, plus 510 nucleotides of 
5' untranslated sequence ending in the guanidine nucleotide 
adjacent to the adenine nucleotide of the proposed initiation 
Of translation as well as 642 nucleotides of 3' untranslated 
sequence. Also shown in SEQ ID No. i is the deduced amino 
acid sequence. a list of partial cDNA clones used to 
characterize the a„ sequence and the nucleotide position of 
each Clone relative to the full-length a„ cDNA sequence, which 
xs set forth in SEQ ID No. 1. is shown below. The isolation 
and characterization of these clones are described below 
(Example II.a.2. ) . 

IMR32 1.144 nt 1 to 510 of SEQ ID No. 1 

5' untranslated sequence, 

nt 511 to 2431, SEQ ID No. 1 

IMR32* 1.136 nt 1627 to 2988, SEQ ID No. 1 

nt 1 to 104 of SEQ ID No. 2 

additional exon, 
1MR32® 1.80 nt 2083 to 6468, SEQ ID No. 1 

IMR32* 1.36 nt 2857 to 4281, SEQ ID No. 1 

IMR32 1.163 nt 5200 to 7635, SEQ ID No. 1 
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♦ 5' of nt 1627, IMR32 1.136 encodes an intron and an 
additional axon described in Example II.A.2.d. 

® IMR32 1,80 contains two deletions, nt 2984 to 3131 
and nt 5303 to 5349 (SEQ ID No. 1) . The 148 nt 
deletion (nt 2984 to 3131) was corrected by 
performing a polymerase chain reaction described in 
Example II.A.3.b. 
# IMR32 1-36 contains a 132 nt deletion (nt 3081 to 
3212) . 

2. Isolation and characterization of individual 
clones listed in Bxanple ZI.A.l. 

a. IMR32 1.36 

Two million recombinants of the IMR32 cDNA library #1 
(Example I.B.I.) were screened in duplicate at a density of 
approximately 200,000 plaques per 150 mm plate using a mixture 
of radiolabelled fragments of the coding region of the rabbit 
skeletal muscle calcium channel cDNA [for the sequence of 
the rabbit skeletal muscle calcitim channel a^^ subunit cDNA, 
see, Tanabe et al. (1987). Nature 32^:313- 

318] : Fragment Nucleotides 

JCpnl-EcoRI -78 to 1006 

£coRI-JChoI 1006 to 2653 

Apal-Apal 3093 to 4182 

Bgrlll-Saci 4487 to 5310 

The hybridization uras performed using low stringency 
hybridization conditions (Example I.e.) and the filters were 
washed under low stringency (Example I.e.). Only one a^- 
specific recombinant {IMR32 1.36) of the 2 x 10* screened was 
identified. IMR32 1.36 was plaque purified by standard 
methods (J. Sambrook et al. (1989) Molecular cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Chapter 8) subcloned into pGEM3 (Promega, Madison, WI) and 
characterized by DNA sequencing. 

b. IHR32 1.80 
Approximately 1 x 10^ recombinants of the IMR32 cDNA 
library #2 (Example I.B.2.) were screened in duplicate at a 
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denslty of approximately 100,000 plaques per 150 mm plate 
using the IMR32 1.36 cDNA fragment (Example II. A. 1) as a 
probe. Standard hybridization conditions were used, and the 
filters were washed under high stringency (Example i.e.). 
Ihree positive plaques were identified one of which was IMR32 
1.80. IMR32 1.80 was plaque purified by standard methods, 
restriction mapped, subcloned, and characterized by DKA 
sequencing . 

c. IMR32 1.144 

Approximately l x 10« recombinants of the IMR32 cDNA 
library #3 (Example I.B.3) were screened with the £coRl-Pvull 
fragment (nt 2083 to 2518. SEQ ID No. l) of IMR32 1.80. The 
hybridization was performed using standard hybridization 
conditions (Example i.e.) and the filters were washed under 
high stringency (Example I.e.). Three positive plaques were 
identified one of which was IMR32 1.144. IMR32 1.144 was 
plaque purified, restriction mapped, and the cDNA insert was 
subcloned into pGEM7Z (Promega, Madison, Wl) and characterized 
by DNA sequencing. This characterization revealed that IMR32 
1.144 has a series of ATG codons encoding seven possible 
initiating methionines (nt 511 to 531, SEQ ID No. l) . Nucleic 
acid amplification analysis, and DNA sequencing of cloned 
nucleic acid amplification analysis products encoding these 
seven ATG codons confirmed that this sequence is present in 
the transcript eaqiressed in dbcAMP- induced IMR32 cells, 
d. I11R32 1.136 
Approximately i x 10* recombinants of the IMR32 cDNA 
library #4 (Example I.B.4) were screened with the BcoRI-PvuII 
fragment (nt 2083 to 2518, SEQ ID No. l) of 1MR32 1.80 
(Example II.A.l.) . The hybridization was performed using 
standard hybridization conditions (Example i.e.) and the 
filters were washed under high stringency (Example I.e.) . six 
positive plaques were identified one of which was IMR32 1.136. 
IMR32 1.136 was plaque purified, restriction mapped, and the 
CDNA insert was subcloned into a standard plasmid vector, 
PSP72 (Promega, Madison, wi.), and characterized by DNA 
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sequencing. This characterization revealed that IMR32 1.136 
encodes an incompletely spliced a^^ transcript. The clone 
contains nucleotides 1627 to 2988 of SEQ id No. l preceded by 
an approximate 640 bp intron. This intron is then preceded by 
a 104 nt exon (SBQ ID No. 2) which is an alternative exon 
encoding the IS6 transmembrane domain [see, e.g., Tanabe et 
al. (1987) J«ature 325:313-318 for a description of the ISl to 
IVS6 transmembrane terminology] of the a^^ subunit and can 
replace nt 1627 to 1730, SEQ ID No. 1, to produce a completely 
spliced ofu, transcript. 

e. I1IR32 1.163 
Approximately 1 x 10« recombinants of the IMR32 cDNA 
library #3 (Example I.B.3.) were screened with the Ncol-Xbol 
fragment of IMR32 1.80 (Example Il.A.i.) containing nt 5811 to 
6468 (SEQ ID No. 1). The hybridization was performed using 
standard hybridization conditions (Example I.e.) and the 
filters were washed under high stringency (Exanple i.e.). 
Three positive plaques were identified one of ^rtiich was IMR32 
1.163. IMR32 1.163 was plaque purified, restriction mapped, 
and the cDNA insert was subcloned into a standard plasmid 
vector, PSP72 (Promega, Madison, Wl.), and characterized by 
DNA sequencing. This characterization revealed that IMR32 
1.163 contains the ar„ termination codon, nt 6994 to 6996 (SEQ 
ID No. 1) . 

^* * full-length cDNA 

[pVDCCIII (A) ] 

OfiB CDNA clones IMR32 1.144, IMR32 1.136, IMR32 1.80, and 
IMR32 1.163 (Example II. A. 2.) overlap and include the entire 
o„ coding sequence, nt 511 to 6993 (SEQ ID No. 1), with the 
exception of a 148 bp deletion, nt 2984 to 3131 (SEQ ID No. 
1) . Portions of these partial cDMA clones were ligated to 
generate a full-length o^, cDKA in a eukaryotlc expression 
vector. The resulting vector was called pVDeciIl(A). The 
construction of pVDCCIll (A) was performed in four steps 
described in detail below: (1) the construction of 
pVDCCIll/5' using portions of IMR32 1.144, IMR32 1.136, and 
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IMR32 1.80, (2) the construction of pVDCCIII/5' .3 that 
corrects the 148 nt deletion in the IMR32 1.80 liortion of 
pVDCCIII/5', (3) the construction of pVDCCIII/3'.i using 
portions of IMR32 1.80 and IMR32 1.163, and (4) the ligation 
of a portion of the pVDCCIli/5' .3 insert, the insert of 
pVDCCIII/3'.l, andpcDNAl (Invitrogen, San Diego, CA) to form 
pVDcciii (A) . The vector pcDNAi is a eukaryotic expression 
vector containing a cytomegalovirus (CMV) promoter which is a 
constitutive promoter recognized by mammalian host cell RNA 
polymerase II. 

Each of the DNA fragments used in preparing the full- 
length construct was purified by electrophoresis through an 
agarose gel onto DE81 filter paper (Whatman, Clifton. Nj) and 
elution from the filter paper using 1.0 M NaCl, 10 mM TRIS, pH 
8.0, 1 mM EDTA. The ligations typically were performed in a 
10 Hi reaction volume with an equal molar ratio of insert 
fragment and a two- fold molar excess of the total insert 
relative to the vector. The amount of DNA used was normally 
about 50 ng to 100 ng. 

a. pVDCCXII/5' 

To construct pVDCCIII/5', IMR32 1.144 (Example II.A.2.C.) 
was digested with Xhol and BcoRI and the fragment containing 
the vector (pGEM7Z) , a„ nt 1 to 510 {SEQ ID No. 1) , and nt 
511 to 1732 (SEQ ID No. 1) was isolated by gel 
electrophoresis. The ScoRI-Apal fragment of IMR32 1.136 
(Example II.A.2.d.) nucleotides 1733 to 2671 (SEQ ID No. 1) 
was isolated, and the Apal-lfindlll fragment of IMR32 1.80 
(Example II.A.2.b.), nucleotides 2672 to 4492 (SEQ ID No. 1) 
was isolated. The three DNA clones were ligated to form 
pVDCCIII/5' containing nt 1 to 510 (5' untranslated sequence; 
SEQ ID No. 1) and nt 511 to 4492 (SEQ ID No. 1) , 

b. pVDCCIII/S' .3 
Coii?)arison of the IMR32 1.36 and IMR32 1.80 DNA sequences 
revealed that these two cDNA clones differ through the 
coding sequence, nucleotides 2984 to 3212. nucleic acid 
amplification analysis of IMR32 1.80 and dbcAMP- induced 
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(1.0 mM, 10 days) IMR32 cytoplasmic RNA (isolated according to 
Ausubel, P.M. et al. (Eds) (1988) Current Protocols in 
Molecular Biology, John Wiley and Sons, New York) revealed 
that IMR32 1.80 had a 148 nt deletion, ut 2984 to 3131 (SEQ ID 
No. 1), and that IMR32 1.36 had a 132 nt deletion, nt 3081 to 
3212. To perform the nucleic acid amplification analysis, the 
amplification reaction was primed with a^o-specific 
oligonucleotides 112 (nt 2548 to 2572, SEQ ID No. l) and 311 
(the complementary sequence of nt 3928 to 3957, SEQ ID No. 1) . 
These products were then reamplified using CTio-specif ic 
oligonucleotides 310 (nt 2583 to 2608 SEQ ID No. 1) and 312 
(the complementary sequence of nt 3883 to 3909) . This 
reamplified product, which contains Accl and Bglll restriction 
sites, was digested with Accl and Bglll and the Accl-Bglll 
fragment, nt 2765 to 3890 (SEQ ID No. 1) was cloned into Accl- 
Bglll digested pVDCCIII/5' to replace the Accl-Bglll 
pVDCCIII/5' fragment that, had the deletion. This new 
construct was named pVDCCIII/5' .3 . DNA sequence determination 
of pVDCCIII/5' .3 through the amplified region confirmed the 
148 nt deletion in IMR32 1.80. 

c. pVDCCIII/3' .1 
To construct pVDCCIIl/3 ' . 1, the cDNA insert of IMR32 
1.163 (Example II.A.2.e.) was subcloned into pBluescript II 
(Stratagene, La Jolla, CA) as an Xhol fragment. The JChoI 
sites on the cDNA fragment were furnished by the adapters used 
to construct the cDNA library (Example I.E. 3.). The insert 
was oriented such that the translational orientation of the 
insert of IMR32 1.163 was c^posite to that of the lacZ gene 
present in the plasmid, as confirmed by analysis of 
restriction enzyme digests of the resulting plasmid. This was 
done to preclude the possibility of expression of a^p sequences 
in DH5ae cells transformed with this plasmid due to fusion with 
the lacZ gene. This plasmid was then digested with JJindlll 
and Bglll and the Hindlll - Bglll fragment (the Hindlll site 
comes from the vector and the Bgrill site is at nt 6220, SEQ ID 
No. 1) was eliminated, thus deleting nt 5200 to 6220 (SEQ ID 
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No. 1) of the IMR32 1.163 clone and removing this sequence 
from the remainder of the plasmid which contained the 3' Bglu 
- Xhol fragment. nt 6221 to 7635 (SEQ ID No. i) . 
pVDCCIIl/3'.l was then made by splicing together the J^indlli- 
Pvull fragment from 1MR32 1.80 (nucleotides 4493-5296, SBQ id 
No. 1), the Pvull - Bglil fragment of IMR32 1.163 (nucleotides 
5294 to 6220, SEQ ID No. 1) and the Hindi II -BglH-digested 
pBluescript plasmid containing the 3' Bglll/Xhol IMR32 1.163 
fragment (nt 6221 to 7635, SEQ ID No. 1) . 

d. pVDCCIlKA): the full-length a„ 
constzuct 

To construct pVDCCIII (A) , the i>raI-HindIII fragment (5' 
untranslated sequence nt 330 to 510, SEQ ID No. l and coding 
sequence nt 511 to 4492, SEQ ID No. 1) of pVDCCIll/s ' .3 
(Example II.A.3.b.) was isolated; the Hindlll-JOioI fragment of 
pVDCClii/3' .1 (containing nt 4493 to 7635, SEQ ID No. 1, plus 
the Xhol site of the adapter) (Example ll.A.3.c,) was 
isolated; and the plasmid vector, pcDNAl, was digested with 
BcoRV and Xhol and isolated on an agarose gel. The three DMA 
fragments were ligated and MC1061-P3 (Invitrogen, San Diego, 
CA) was transformed. Isolated clones were analyzed by 
restriction mapping and DNA sequencing and pVDCCIII (A) was 
identified which had the fragments correctly ligated together: 
r>raI-HindIII, Hindlll-Xhol, Xbol-EcdRV with the blunt-end Dral 
and EccXV site ligating together to form the circular plasmid. 

The amino- terminus of the a^^, subunit is encoded by the 
seven consecutive 5' methionine codons (nt 511 to 531, SEQ ID 
Mo. 1) . This 5' portion plus nt 532 to 537, 
encoding two lysine residues, were deleted from pVDCClli(A) 
and replaced with an efficient ribosomal binding site (5'- 
ACCACC-3 ' ) to form pVDCCIII . RBS (A) . Expression experiments in 
which transcripts of this construct were injected into Xenc^s 
laevis odcytes did not result in an enhancement in the 
recombinant voltage-dependent calcium channel expression level 
relative to the level of expression in oocytes injected with 
tremscripts of pVDCCIII (A) . 
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B, Isolation of DNA encoding the a^^ subunit 

1. Reference List of Partial a,c cDNA clones 

Numerous a^c-specif ic cDNA clones were isolated in order 
to characterize the ot,^ coding sequence, the a,^ initiation of 
translation, and an alternatively spliced region of a,^. SEQ 
ID No. 3 sets forth one a.^ coding sequence (a^^.i) and deduced 
amino acid sequence; SEQ ID No. 36 sets forth another splice 
variant designated a,^.,. SEQ ID No. 4 and No. 5 encode two 
possible amino terminal ends of an splice variant. SEQ ID 
No. 6 encodes an alternative exon for the IV S3 transmembrane 
domain. Other ot^^ variants can be constructed by selecting the 
alternative amino terminal ends in place of the ends in SEQ ID 
No. 3 or 36 and/or inserting the alternative exon (SEQ ID No. 
6) in the appropriate location, such as in SEQ ID NO. 3 in 
place of nucleotides 3904-3987. m addition, the 75 
nucleotide sequence (nucleotides 1391-1465 in SEQ ID No, 3) 
can be deleted or inserted to produce an alternative a^^ splice 
variant . 

Shown below is a list of clones used to characterize the 
sequence and the nucleotide position of each clone relative 
to the characterized a,c sequence (SEQ ID No. 3). The 
isolation and characterization of these cDNA clones are 
described below (Example II.B.2) . 



IMR32 

INR32 

IMR32 
*IMR32 
•l . 16G 
IMR32 
CNS 



IMR32 



1.66 

1.157 

1.67 
1.86 

1.37 
1.30 

1.38 



nt 1 to 916, SEQ ID No. 3 
nt 1 to 132, SEQ ID No. 4 
nt 1 to 873, SEQ ID No. 3 
nt 1 to 89, SEQ ID No. 5 
nt 50 to 1717, SEQ ID No. 3 
nt 1366 to 2583, SEQ ID No. 3 
nt 758 to 867, SEQ ID No. 3 
nt 2804 to 5904, SEQ ID No. 3 
nt 2199 to 3903, SEQ ID No. 3 
nt 1 to 84 of alternative exon, 

SEQ ID No. 6 
nt 2448 to 4702, SEQ ID No. 3 
nt 1 to 84 of alternative exon. 
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SEQ ID No. 6 

* IMR32 1.86 has a 73 nt deletion compared to the rabbit 
cardiac muscle calcium channel a, subunit cDNA sequence. 
"1.16G is an a^c genomic clone. 

2. Isolation and charaetsrizatlon of 
clones described In Kxanple ll.B.l. 
a. CHS 1.30 

Approximately l x 10« recombinants of the human thalamus 
CDNA library No. 6 (Example I.B.6.) were screened with 
fragments of the rabbit skeletal muscle calcium channel a 
CDNA described in Example II. A. 2. a. Hhe hybridization was 
performed using standard hybridization conditions (Example 
I.e.) and the filters were washed under low stringency 
(Example I.e.) . Six positive plaques were identified, one of 
which was CNS 1.30. CNS 1.30 was plaque purified, restriction 
mapped, subcloned, and characterized by dna sequencing. CNS 
1.30 encodes of,c-specific sequence nt 2199 to 3903 (SEQ ID No. 
3) followed by nt 1 to 84 of one of two identified alternative 
aac exons (SEQ ID No. 6). 3' of SEQ ID No. 6, OIS 1.30 
contains an intron and, thus, CNS 1.30 encodes a partially 
spliced otic transcript. 

b. 1.16a 

Approximately l x 10* recombinants of a XEMBL3 -based 
human genomic DNA library (Cat « HLlOOed Clontech Corp.. Palo 
Alto. CA) were screened using a rabbit skeletal muscle cDNA 
fragment (nt -78 to 1006, Example II.A.2.a.). The 
hybridization was performed using standard hybridization 
conditions {Exan?)le I.e.) and the filters were washed under 
low stringency (Example I.e. ) . Fourteen positive plaques were 
identified, one of which was 1.16G. Clone 1.16G was plaque 
purified, restriction mapped, subcloned, and portions were 
characterized by DNA sequencing. DNA sequencing revealed that 

I. 16G encodes ajc-specific sequence as described in Example 

II. B.i. 
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c. IMR32 1.66 and IMR32 1.67 

Approximately l x 10^ recombinants of IMR32 cDNA library 
#5 (Example I.B.5.) were screened with a 151 bp Kpnl-Sacl 
fragment of 1.16G (Example II.B.2*b.) encoding a^^ sequence (nt 
758 to 867, SEQ ID No. 3). The hybridization was performed 
using standard hybridization conditions (Example I.C.).. The 
filters were then washed in 0.5 x SSPE at 65*>C. of the 
positive plaques, IMR32 1.66 and IMR32 1.67 were identified. 
The hybridizing plaques were purified, restriction mapped, 
subcloned, and characterized by DNA sequencing. Two of these 
cDNA clones, IMR32 1.66 and 1.67, encode a^^ subunits as 
described (Example II.B.l.). m addition, IMR32 1.66 encodes 
a partially spliced a^^ transcript marked by a GT splice donor 
dinucleotide beginning at the nucleotide 3' of nt 916 (SEQ ID 
No. 3). The intron sequence within 1.66 is 101 nt long. 
INR32 1.66 encodes the a^^ initiation of translation, nt i to 
3 (SEQ ID No. 3) and 132 nt of 5' untranslated sequence (SEQ 
ID No. 4) precede the start codon in IMR32 1.66. 

d. ZMR32 1.37 and XMR32 1.38 
Approximately 2 x 10* recombinants of IMR32 cDNA library 
#1 (Example I.B.i.) were screened with the CNS 1.30 cDNA 
fragment (Example II.B.2.a.) . The hybridization was performed 
using low stringency hybridization conditions (Example I.e.) 
and the filters were washed under low stringency (Exaiiple 
I.e.). Four positive plaques were identified, plaque, 
purified, restriction mapped, subcloned, and characterized by 
DNA sequencing. Two of the clones, IMR32 1.37 and IMR32 1.38 
encode aic-specific sequences as described in Example II.B.l. 

DNA sequence comparison of IMR32 1.37 and IMR32 i.ss 
revealed that the a^c transcript includes two exons that encode 
the IVS3 transmembrane domain. IMR32 1.37 has a single exon, 
nt 3904 to 3987 (SEQ ID No. 3) and IMR32 1.38 appears to be 
anomalously spliced to contain both exons juxtaposed, nt 3904 
to 3987 (SEQ ID No. 3) followed by nt 1 to 84 (SEQ ID No. 6) . 
The alternative splice of the a^^ transcript to contain either 
of the two exons encoding the IVS3 region was confirmed by 
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con^aring the CNS 1.30 sequence to the IMR32 1.37 sequence. 
CNS 1.30 contains nt 1 to 84 {SEQ ID No. s) preceded by the 
identical sequence contained in IMR32 1.37 for nt 2199 to 3903 
{SBQ ID No. 3). As described in Bxanqple II.B.2.a., an intron 
follows nt 1 to 84 (SEQ ID No. 6) . Two alternative exons have 
been spliced adjacent to nt 3903 (SEQ ID No. 3) represented by 
CNS 1.30 and IMR32 1.37. 

e. IMR32 1.86 
IMR32 cDNA library #1 (Example I.B.I.) was screened in 
duplicate using oligonucleotide probes 90-9 (nt 1462 to 1491, 
SEQ ID No. 3) and 90-12 (nt 2496 to 2520, SEQ ID No. 3). 
These oligonucleotide probes were chosen in order to isolate 
a clone that encodes the a^^ subunit between the 3' end of 
IMR32 1.67 (nt 1717, SEQ ID No. 3) and the 5' end of CNS 1.30 
(nt 2199, SEQ ID No. 3), The hybridization conditions were 
standard hybridization conditions (Example I.e.) with the 
exception that the 50% deionized formamide was reduced to 20%. 
The filters were washed under low stringency (Example I.e.). 
Three positive plaques were identified one of which was IMR32 
1.86. IMR32 1,86 was plaque purified, subcloned, and 
characterized by restriction mapping and DMA sequencing. 
IMR32 1.86 encodes a^^ sec[uences as described in Example 
II-B.l. Characterization by DNA sequencing revealed that 
IMR32 1.86 contains a 73 nt deletion compared to the DNA 
encoding rabbit cardiac miuscle calcium channel subunit 
[Mikami et al. (1989) Nature 340:230], nt 2191 to 2263. These 
missing nucleotides correspond to nt 2176-2248 of SEQ ID No. 
3. Because the 5 '-end of CNS 1.30 overlaps the 3 '-end of 
IMR32 1.86, some of these missing nucleotides, i.e., nt 2205- 
2248 of SEQ ID No. 3, are accounted for by CNS 1.30. The 
remaining missing nucleotides of the 73 nucleotide deletion in 
IMR32 1.86 (i.e., nt 2176-2204 SEQ ID No. 3) were determined 
by nucleic acid amplification analysis of dbcAMP- induced IMR32 
cell RNA. The 73 nt deletion is a frame-shift mutation and, 
thus, needs to be corrected. The exact human sequence through 
this region, (which has been determined by the DNA sequence of 
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CNS 1.30 and nucleic acid ainplif ication analysis of IMR32 cell 
RNA) can be inserted into IMR32 1.86 by standard methods, 
e-gr., replacement of a restriction fragment or site-directed 
mutagenesis . 

f . IMR32 1.157 

One million recombinants of IMR32 cDNA library #4 

(Example I.B.4.) were screened with an Xbol-EcoRl fragment of 

IMR32 1.67 encoding a^^ nt 50 to 774 (SEQ ID No. 3), The 

hybridization was performed using standard hybridization 

conditions (Example I.e.) . The filters were washed under high 

stringency (Example I^C). One of the positive plaques 

identified was IMR32 1.157. This plaque was purified, the 

insert was restriction mapped and subcloned to a standard 

plasmid vector pGEM7Z (Promega, Madison, WI) . The DMA was 

characterized by sequencing. IMR32 1.157 appears to encodes 

an alternative 5' portion of the a^^ sequence beginning with 

nt 1 to 89 (SEQ ID No. 5) and followed by nt 1 to 873 (SEQ ID 

No . 3 ) . Analysis of the 1.66 and 1 . 157 5 ' sequence is 

described below (Example II. B. 3.). 

3- Characterization of the initiation of 
translation site 

Portions of the sequences of IMR32 1.157 (nt 57 to 89, 
SEQ ID No. 5; nt 1 to 67, SEQ ID No. 3), IMR32 1.66 (nt 100 to 
132, SEQ ID No. 4; nt 1 to 67, SEQ ID No. 3) , were compared to 
the rabbit lung CaCB-receptor cDNA sequence, nt -33 to 67 
[Biel et ai. (1990) FEES Lett. 269:409]. The human sequences 
are possible alternative 5' ends of the a^c transcript encoding 
the region of initiation of translation. IMR32 1.66 closely 
matches the CaCB receptor cDNA sequence and diverges from the 
CaCB receptor cDNA sequence in the 5' direction begiiming at 
nt 122 (SEQ ID No. 4) . The start codon identified in the CaCB 
receptor cDNA sequence is the same start codon used to 
describe the ot^c coding sequence, nt l to 3 (SEQ ID No. 3). 

The sequences of a^c splice variants, designated a^c-i smd 
aic-2 are set forth in SEQ ID NOs. 3 and 36. 
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C. Isolation of partial eDNA clones encoding the a 
subunit and construction of a full-length clone " 

A human basal ganglia cDNA library was screened with the 
rabbit skeletal muscle a, subunit cDNA fragments (see Example 
II. A. 2. a for description of fragments) under low stringency 
conditions. One of the hybridizing clones was used to screen 
an IMR32 cell cDNA library to obtain additional partial 
CDNA clones, which were in turn used to further screen a^ 
IMR32 cell CDNA library for additional partial cDNA clones. 
One of the partial IMR32 er„ clones was used to screen a human 
hippocanpus library to obtain a partial a„ clone encoding the 
3' end of the a„ coding sequence. The sequence of some of the 
regions of the partial cDNA clones was compared to the 
sequence of products of nucleic acid amplification analysis of 
IMR32 cell RNA to determine the accuracy of the cDNA 
sequences . 

Nucleic acid ainplif ication analysis analysis of imr32 
cell RNA and genomic DNA using oligonucleotide primers 
corresponding to sequences located 5' and 3' of the STOP codon 
of the DNA encoding the or,, subunit revealed an alternatively 
spliced ff„-encoding mRNA in IMR32 cells. This second mRNA 
product is the result of differential splicing of the o,, 
subunit transcript to include another exon that is not present 
in the mRNA corresponding to the other 3' o„ cDNA sequence 
that was initially isolated. To distinguish these splice 
variants of the stibunit, the subunit encoded by a DMA 
sequence corresponding to the form containing the additional 
exon is referred to as a„., (SEQ ID No. 7) , whereas the subunit 
encoded by a DNA sequence corresponding to the form lacking 
the additional exon is referred to as a„.2 (SEQ ID No. 8) . The 
sequence of o(„., diverges from that of a„., beginning at nt 
6633 (SEQ ID No. 7) . Following the sequence of the additional 
exon in Ois-i (nt 6633-6819; SEQ ID No. 7), the a„.i and 
sequences are identical (i.e., nt 6820-7362 in SEQ ID No.""? 
and nt 6633-7175 in SEQ ID No. 8) . SEQ ID No. 7 and No. 8 set 
forth 143 nt of 5' untranslated sequence (nt 1-143) as well as 
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202 nt Of 3' untranslated sequence (nt 7161-7362, SEQ ID No. 
7) of the DNA encoding a^.^ and 321 nt of 3' untranslated 
sequence (nt 6855-7175, SEQ ID No. 8) of the DNA encoding 

Nucleic acid ainplification analysis analysis of the IS6 
region of the a„ transcript revealed what appear to be 
additional splice variants based on multiple fragment sizes 
seen on an ethidium bromide -stained agarose gel containing the 
products of the amplification reaction. 

A full-length a^.^ cDNA clone designated pcDNA-a^..^ was 
prepared in an eight-step process as follows. 
STEP 1: The Sad restriction site of pGEM3 (Promega, 
Madison, WI) was destroyed by digestion at the Sad 
site, producing blunt ends by treatment with T4 DNA 
polymerase, and religaticai. The new vector was 
designated pGEMASac. 
STEP 2: Fragment 1 ( Hindi II /JCpnl ; nt 2337 to 4303 of SEQ ID 
No. 7) was ligated into Hindlll/Kpnl digested 
pGEMBASac to produce pai.i77HK. 
STEP 3: Fragment l has a 2 nucleotide deletion (nt 3852 and 
3853 of SEQ ID No. 7) . The deletion was repaired 
by inserting an aiii>lfied fragment (fragment 2) of 
IMR32 RNA into po(l.l77HK. Thus, fragment 2 
(J«arI/JCpnI; nt 3828 to 4303 of SEQ ID No. 7) was 
inserted into Narl/Kpnl digested pal.i77HK 
replacing the Narl/iqpaal portion of fragment i and 
producing pol.l77HK/PCR. 
STEP 4: Fragment 3 (J^/I^l; nt 4303 to 5663 of SEQ ID 
No. 7) was ligated into Kpnl digested pal.l77HK/PCR 
to produce palBS'K. 
STEP 5: Fragment 4 (ScoRI/Hindlll; EcoRI adaptor plus nt 1 
to 2337 of SEQ ID No. 7) and fragment 5 
(Hindlll/JChoI fragment of palBS'K; nt 2337 to 5446 
of SEQ ID No. 7) were ligated together into 
EcdRI/Xbol digested pcDNAi (Invitrogen. San Diego. 
CA) to produce palBS' . 
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STEP 6: Fragment 6 lEcoRl/EcoRl; EcoRl adapters on both 
ends plus nt 5749 to 7362 of SEQ ID No. 7) was 
ligated into EcoRl digested pBluescript II KS 
(Stratagene, La Jolla, CA) with the 5' end of the 
fragment proximal to the i^i site in the 
polyl inker to produce pal. 230. 
STEP 7: Fragment 7 iKpal/XboI: nt 4303 to 5446 of SEQ ID 
No. 7), and fragment 8 {Xhol/Cspl; nt 5446 to 6259 
of SEQ ID No. 7) were ligated into Fpnl/Cspl 
digested pQfl.230 (removes nt 5749 to 6259 of SEQ ID 
No. 7 that was encoded in pal. 230 and maintains nt 
6259 to 7362 of SEQ ID No. 7) to produce palB3' . 
STEP 8: Fragment 9 {Sphl/Xhol; nt 4993 to 5446 of SEQ ID 
No- 7) and fragment 10 (Xhol/Xbal of palB3'; nt 
5446 to 7319 of SEQ ID No. 7) were ligated into 
Sphl/Xbal digested palB5' (removes nt 4993 to 5446 
of SEQ ID No. 7 that were encoded in palB5' and 
maintains nt l to 4850 of SEQ ID No. 7) to produce 
pcDN7U)fiB_i . 

The resulting construct, pcDNAaiB.^, contains, in pCDNAl, 

a full-length coding region encoding a^.^ (nt 144-7362, SEQ ID 

No. 7), plus 5' untranslated sequence (nt 1-143, SEQ ID No. 7) 

and 3' untranslated sequence (nt 7161-7319, SEQ ID No. 7) 

under the transcriptional control of the CMV promoter. 

D. Isolation of DNA encoding human calcium channel a, 
subunits ^ 

!• Isolation of partial clones 

DNA clones encoding portions of human calcium channel a^^^ 
subunits were obtained by hybridization screening of human 
cerebellum cDNA libraries and nucleic acid amplification of 
hianan cerebellum RNA. Clones corresponding to the 3' end of 
the ttj^ coding sequence were isolated by screening 1 x 10* 
recombinants of a randomly primed cerebellum cDNA library 
(size-selected for inserts greater than 2,8 kb in length) 
imder low stringency conditions {6X SSPE, 5X Denhart's 
solution, 0.2% SDS, 200 fxg/ml sonicated herring sperm DNA, 
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42 »C) with oligonucleotide 704 containing nt 6190-6217 of the 
rat coding sequence [Starr et al. (1992) Proc. Natl. Acad. 
Sci. U.S.A. 8fl:5621-5625] . Washes were performed under low 
stringency conditions. Several clones that hybridized to the 
probe (clones o(l.251-al.259 and al.244) were purified and 
characterized by restriction enzyme mapping and una sequence 
analysis. At least two of the clones, al.244 and al.254, 
contained a translation termination codon. Although clones 
orl.244 and al.254 are different lengths, they both contain a 
sequence of nucleotides that corresponds to the extreme 3' end 
of the a^, transcript, i.e,, the two clones overlap. These two 
clones are identical in the region of overlap, except, clone 
al.244 contains a sequence of 5 and a sequence of 12 
nucleotides that eure not present in al.254. 

To obtain additional Ou^-encoding clones, 1 x 10* 
recombinants of a randomly primed cerebellum cDNA library 
(size-selected for inserts ranging from l.o to 2.8 kb in 
length) was screened for hybridization to three 
oligonucleotides : oligonucleotide 701 (containing nucleotides 
2288-2315 of the rat coding sequence) , oligonucleotide 702 
(containing nucleotides 3559-3585 of the rat a^^ coding 
sequence) and oligonucleotide 703 (containing nucleotides 
4798-4827 of the rat coding sequence) . Hybridization and 
washes were performed using the same conditions as used for 
the first screening with oligonucleotide 704, except that 
washes were conducted at 45«»C. Twenty clones (clones al.269- 
Ofl.288) hybridized to the probe. Several clones were plaque- 
purified and characterized by restriction enzyme mapping and 
DMA sequence analysis. One clone, al.279, contained a 
sequence of about 170 nucleotides that is not present in other 
clones corresponding to the same region of the coding 
sequence. This region may be present in other splice 
variants. None of the clones contained a translation 
intiation codon. 

To obtain clones corresponding to the 5' end of the human 
coding sequence, another cerebellum cDKA library was 
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prepared using oligonucleotide 720 (containing nucleotides 
2485-2510 of SEQ ID No. 22) to specifically prime first-strand 
cDNA synthesis. The library (B x 10^ recombinants) was 
screened for hybridization to three oligonucleotides: 
oligonucleotide 701. oligonucleotide 726 (containing 
nucleotides 2333-2360 of the rat coding sequence) and 
oligonucleotide 700 (containing nucleotides 767-796 of the rat 
o(„ coding sequence) under low stringency hybridization and 
washing conditions. Approximately 50 plaques hybridized to 
the probe. Hybridizing clones al.381-al.390 were plaque- 
purified and characterized by restriction enzyme maping and 
DMA sequence analysis. At least one of the clones, al.38l, 
contained a translation initiation codon. 

Alignment of the sequences of the purified clones 
revealed that the sequences overlapped to comprise the entire 
OoA coding sequence. However, not all the overlapping 
sequences of partial clones contained convenient enzyme 
restriction sites for use in ligating partial clones to 
construct a full-length of^^ coding sequence. To obtain DNA 
fragments containing convenient restriction enzyme sites that 
could be used in constructing a full-length DNA. cDNA was 
synthesized from RNA isolated from human cerebellum tissue and 
subjected to nucleic acid amplification. The oligonucleotides 
used as primers corresponded to human coding sequence 
located 5' and 3' of selected restriction enzyme sites. Thus, 
in the first amplification reaction, oligonucleotides 753 
(containing nucleotides 2368-2391 of SEQ ID No. 22) and 728 
(containing nucleotides 3179-3202 of SEQ ID No. 22) were used 
as the primer pair. To provide a sufficient amount of the 
desired DNA fragment, the product of this an^lif ication was 
rearaplified using oligonucleotides 753 and 754 (containing 
nucleotides 3112-3135 of SEQ ID No. 22 as the primer pair. 
The resulting product was 768 bp in length. In the second 
ainplification reaction, oligonucleotides 719 (containing 
nucleotides 4950-4975 of SEQ ID No. 22 and 752 (containing 
nucleotides 5647-5670 of SEQ ID No. 22) were used as the 
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primer pair. To provide a sufficient amount of the desired 
second DNA fragment, the product of this amplification was 
reamplif ied using oligonucleotides 756 (containing nucleotides 
5112-5135 of SEQ ID No. 22) and 752 as the primer pair. The 
resulting product was 559 bp in length. 

2. Construction of £ull-L«iigth a^^ coding sequences 
Portions of clone «l.381, the 768 -bp nucleic acid 
amplification product, clone ai.278, the 559-bp nucleic acid 
amplification product, and clone ai.244 were ligated at 
convenient restriction sites to generate a full-length 
coding sequence referred to as a^j^.^. 

Comparison of the results of sequence analysis of clones 
ofl.244 and 01.254 indicated that the primary transcript of the 
Of„ subunit gene is alternatively spliced to yield at least two 
variant mRNAs encoding different forms of the subunit. One 
form, ofu^.^, is encoded by the sequence shown in SEQ ID No. 22. 
The sequence encoding a second form, a^.,, differs from the a^, 
i-encoding sequence at the 3' end in that it lacks a s-nt 
sequence found in clone «1.244 (nucleotides 7035-7039 of SEQ 
ID No. 22). This deletion shifts the reading frame and 
introduces a translation termination codon resulting in an a^.^ 
coding sequence that encodes a shorter subunit than that 
encoded by the a^., splice variant. Consequently, a portion 
of the 3' end of the a^,.! coding sequence is actually 3' 
untranslated sequence in the a^.^ DNA. The complete sequence 
of of^^.j, which can be constructed by ligating portions of clone 
al.38l, the 768-bp nucleic acid amplification product, clone 
al.278, the 559-bp nucleic acid amplification product and 
clone Of 1.2 54, is set forth in SEQ ID No. 23. 

E. Isolation of DNA Encoding the Bj, Subunit 
DNA encoding a^^ subunits of the human calcium channel 
were isolated from human hippocampus libraries. The selected 
clones sequenced. DNA sequence analysis of DNA clones 
encoding the a„ subunit indicated that at least two 
alternatively spliced forms of the same subunit primary 
transcript are esqjressed. One form has the sequence set forth 
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in SEQ ID NO. 24 and was designated a,,., and the other was 
designated a,,.„ which has the sequence obtained by inserting 
a 57 base pair fragment between nucleotides 2405 and 2406 of 
SEQ ID No. 24. The resulting sequence is set forth in SEQ ID 
No. 25. 

The subunit designated has a calculated molecular 

weight of 254,836 and the subunit designated a„., has a 
calculated molecular weight of 257,348. a,,., has a 19 amino 
acid insertion (encoded by SEQ ID No. 25) relative to in 
the region that appears to be the cytoplasmic loop between 
transmembrane domains IIS6 and IIISl. 

BMMPLR 111: ISOIATIOM OP cim CLONKS BNCQDZNO TOE BUMaN 

VEoacmi, CALCIUM cmniEL subtmit 

A. isolation of partial cDHA clones encoding the B 
subunit and construction of a full-length clone 
encoding the stibunlt cj.oixe 

A human hippocampus cDNA library was screened with the 
rabbit skeletal muscle calcium channel subunit cDNA 

fragment (nt 441 to 1379) [for isolation and sequence of the 
rabbit skeletal muscle calcium channel subunit cDNA, see 
U.S. Patent Application Serial NO. 482,384 or Ruth et al. 
(1989) Science 245:iii5] using standard hybridization 
conditions (Example i.e.). a portion of one of the 
hybridizing clones was used to rescreen the hippocampus 
library to obtain additional cDNA clones. The cDNA inserts of 
hybridizing clones were characterized by restriction mapping 
and DNA sequencing and compared to the rabbit skeletal muscle 
calcium channel /S^ subunit cDNA sequence. 

Portions of the partial subunit cDNA clones were 
ligated to generate a full-length clone encoding the entire /S, 
subunit. SEQ ID No. 9 shows the j8, subunit coding sequence 
(nt 1-1434) as well as a portion of the 3' untranslated 
sequence (nt 1435-1546) . The deduced amino acid sequence is 
also provided in SEQ ID No. 9. in order to perform expression 
experiments, full-length /J, subunit cDNA clones were 
constructed as follows. 
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Step 1: DNA fragment i (-800 bp of 5' untranslated 
sequence plus nt 1-277 of SEQ ID No. 9) was ligated to dna 
fragment 2 (nt 277-1546 of SEQ ID No. 9 plus 448 bp of Intron 
sequence) and cloned into pGEM7Z. The resulting plasraid, p^i- 
1.18, contained a full-length 0, subunit clone that included 
a 44 8 -bp intron. 

Step 2: To replace the S' untranslated sequence of p/ffi- 
1.18 with a ribosome binding site, a double-stranded adapter 
was synthesized that contains an Ec6Rl site, sequence encoding 
a ribosome binding site (5' -ACCACC-3' ) and nt 1-25 of SEQ id 
No. 9. The adapter was ligated to Smal-digested p^l-i.ie, and 
the products of the ligation reaction were digested with 
EcdRI. 

step 3: The BcoRI fragment from step 2 containing the 
EcoRl adapter, efficient ribosome binding site and nt 1-1546 
of SEQ ID No. 9 plus intron sequence was cloned into a plasmid 
vector and designated p^l-l.iSRBS. The £coRI fragment of p/5i- 
1.18RBS was subcloned into £coRl- digested pcDNAl with the 
initiation codon proximal to CMV promoter to form 
pHBCaCHiffi.RBS (A) . 

Step 4: TO generate a full-length clone encoding the 0 
subunit lacking intron sequence, DNA fragment 3 (nt 69-1146 of 
SEQ ID No. 9 plus 448 bp of intron sequence followed by nt 
1147-1546 of SEQ ID No. 9). was subjected to site-directed 
mutagenesis to delete the intron sequence, thereby yielding 
. The EccRI-xhoI fragment of p/8l-i.i8RBS (containing of 
the ribosome binding site and nt 1-277 of SEQ ID No. 9) was 
ligated to the Xhol-£coRi fragment of p/?i(-) (containing of nt 
277-1546 of SEQ ID No. 9) and cloned into pcDNAl with the 
initiation of translation proximal to the CMV promoter. The 
resulting expression plasmid was designated pHBCaCH/5,^s (A) . 
B. Splice Variant /J^.j 

DNA sequence analysis of the DNA clones encoding the /?, 
subunit indicated that in the CNS at least two alternatively 
spliced forms of the same human fi, subunit primary transcript 
are expressed. One form is represented by the sequence shown 
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in SEQ ID No. 9 and is referred to as The sequences of 

^1., and the alternative form, /S^.,, diverge at nt 1334 (SEQ 
ID No. 9) . The coniplete /S,., sequence (nt 1-1851) , including 
3' untranslated sequence (nt 1795-1851), is set forth in SBQ 
ID No. 10. 

maiPUJ IV: ISOLATION OP cDNA CLONES ENCODING THE HDMAN 

NEDRONAL CALCITIM CHANNEL a^-subunlt 
A. Isolation of cDKA clonas 

The complete human neuronal coding sequence (nt 35- 
3310) plus a portion of the 5' untranslated sequence (nt l to 
34) as well as a portion of the 3' untranslated sequence (nt 
3311-3600) is set forth in SEQ ID No. ll. 

To isolate DNA encoding the human neuronal a, subunit, 
human a, genomic clones first were isolated by probing huma^ 
genomic Southern blots using a rabbit skeletal muscle calcium 
channel a, subunit cDNA fragment Int 43 to 272, Ellis et al. 
(19B8) Science 240:1661] . Human genomic DNA was digested with 
EcoRl. electrophoresed, blotted, and probed with the rabbit 
skeletal muscle probe using standard hybridization conditions 
(Example I.e.) and low stringency washing conditions (Example 
I.e.). Two restriction fragments were identified. 3.5 kb and 
3.0 kb. These BcoRI restriction fragments were cloned by 
preparing a Xgtll library containing human genomic EcoRl 
fragments ranging from 2.2 kb to 4.3 kb. The library was 
screened as described above using the rabbit a, probe, 
hybridizing clones were isolated and characterized by DNA 
sequencing. HGCaCHQr2.20 contained the 3.5 kb fragment and 
HGCaCHa2.9 contained the 3.0 kb fragment. 

Restriction mapping and DNA sequencing revealed that 
HGCaCHa2.20 contains an 82 bp exon (nt 130 to 211 of the human 
a, coding sequence, SEQ ID No. ll) on a 650 bp Pstl-JBbal 
restriction fragment and that H6CaCH«2.9 contains 105 1^ of an 
exon (nt 212 to 316 of the coding sequence, SEQ ID No. ll) on 
a 750 bp Xbal-Bglli restriction fragment. These restriction 
fragments were used to screen the human basal ganglia cDNA 
library (Example II.e.2.a.). HBCaCHa2.l was isolated (nt 29 
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to 1163, SEQ ID No. 11) and used to screen a human brain stem 
cDNA library (ATCC Accession No. 37432) obtained from the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, MD. 20852. Two clones were isolated, HBCaCHQr2-5 
(nt 1 to 1162, SEQ ID No. 11) and HBCaCHa2.8 (nt 714 to 1562, 
SEQ ID No. 11, followed by 1600 nt of intervening sequence) . 
A 2400 bp fragment of HBCaCHof2.8 (beginning at nt 759 of SEQ 
ID No. 11 and ending at a Sinai site in the intron) was used to 
rescreen the brain stem library and to isolate HBCaCHa2.11 (nt 
879 to 3600, SEQ ID No. 11). Clones HBCaCHa2.5 and 
HBCaCHQf2.li overlap to encode an entire human brain ot^ 
protein. 

B. Construction of pBBCaCHcXsA 

To construct pHBCaCHofjA containing DNA encoding a full- 
length human calcium channel ©a subunit, an (EcoRI) -PvuII 
fragment of HBCaCHa2.5 (nt l to 1061, SEQ ID No. 11, EcoRI 
adapter, PvuII partial digest) and a Pvnll-Pstl fragment of 
HBCaCHQf2.11 (nt 1061 to 2424 SEQ ID No. 11; PvuII partial 
digest) were ligated into EcoRI -Ps tl -digested pIBl24 
(Stratagene, La Jolla, CA) . Subsequently, an (EcoRI) -PstI 
fragment (nt 1 to 2424 SEQ ID No. 11) was isolated and ligated 
to a Pstl-(EcoRl) fragment (nt 2424 to 3600 SEQ ID No. 11) of 
HBCaCHa2.11 in EcoRI -digested pIBI24 to produce DNA, HBCaCHa2, 
encoding a full-length human brain subunit. The 3600 bp 
jecoRI insert of HBCaCHa2 (nt 1 to 3600, SEQ ID No, 11) was 
subcloned into pcDNAl (pHBCaCHa2A) with the methionine 
initiating codon proximal to the CMV promoter. The 3600 bp 
EcoRI insert of HBCaCHa2 was also subcloned into pSV2dIIFR 
[Subramani et al. (1981). Mol. Cell. Biol. 1:854-864] which 
contains the SV40 early promoter, mouse dihydrofolate 
reductase (dhfr) gene, SV40 polyadenylation and splice sites 
and sequences required for maintenance of the vector in 
bacteria . 
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EXAMPIiB V. DIFFERENTIAL PROCESSING OF THE HUMAN fi. 

TRANSCRIPT AND THE HUMAN TRANSCRIPT 

A. Differential processing of the transcript 

Nucleic acid aitplif ication analysis of the hiiman p^ 
transcript present in skeletal muscle, aorta, hippocampus and 
basal ganglia, and HBK 293 cells revealed differential 
processing of the region corresponding to nt 615-781 of SEQ ID 
No, 9 in each of the tissues. Four different sequences that 
result in five different processed p^ transcripts through this 
region were identified. The p^ transcripts from the different 
tissues contained different combinations of the four 
sequences, except for one of the p^ transcripts expressed in 
HEK 293 cells {p^.^) which lacked all four sequences. 

None of the P^ transcripts contained each of the four 
sequences; however, for ease of reference, all four sequences 
are set forth end-to-end as a single long sequence in SEQ ID 
No. 12. The four sequences that are differentially processed 
are sequence l (nt 14-34 in SEQ ID No. 12) , sequence 2 (nt BB- 
SS in SEQ ID No. 12), sequence 3 (nt 56-190 in SEQ ID No. 12) 
and sequence 4 (nt 191-271 in SEQ ID No. 12) . The forms of 
the ^1 transcript that have been identified include: (l) a 
form that lacks sequence 1 called p^.^ (expressed in skeletal 
muscle) , (2) a form that lacks sequences 2 and 3 called jS^.a 
(e3q>ressed in CNS) , (3) a foirm that lacks sequences 1, 2 and 3 
called (expressed in aorta and HEK cells) and (4) a form 
that lacks sequences 1-4 called P^.^ (expressed in HEK cells) , 
Additionally, the P-^.^ and P^_^ contain a guanine nucleotide (nt 
13 in SEQ ID No. 12) that is absent in the p^_^ and p^,^ forms. 
The sequences of P^ splice variants are set forth in SEQ ID 
Nos. 9, 10 and 33-35. 

B. Differential processing of transcripts encoding the 
a, subunit. 

The complete human neuronal coding sequence (nt 35- 
3307) plus a portion of the 5' untranslated sequence (nt l to 
34) as well as a portion of the 3' untranslated sequence (nt 
3308-3600) is set forth as SEQ ID No. 11. 
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Nucleic acid amplification analysis of the human or, 
transcript present in skeletal muscle, aorta, and CNS revealed 
differential processing of the region corresponding to nt 
1595-1942 of SEQ ID No. 11 in each of the tissues. 

The analysis indicated that the primary transcript 
of the genomic DNA that includes the nucleotides corresponding 
to nt 1595-1942 also includes an additional sequence (SEQ ID 
No. 13: 5' CCTATTGGTGTAGGTATACCAACAATTAATTT 
AAGAAAAAGGAGACCCAATATCCAG 3') inserted between nt 1624 and 
1625 of SEQ ID NO. 11. Five alternatively spliced variant 
transcripts that differ in the presence or absence of one to 
three different portions of the region of the primary 
transcript that includes the region of nt 1595-1942 of SEQ id 
No. 11 plus SEQ ID No. 13 inserted between nt 1624 and 1625 
have been identified. The five ff,-encoding transcripts from 
the different tissues include different combinations of the 
three sequences, except for one of the a, transcripts 
«^re88ed in aorta which lacks all three sequences. None of 
the Of, transcripts contained each of the three sequences. The 
sequences of the three regions that are differentially 
processed are sequence 1 (SEQ ID No. 13), sequence 2 ( 5' 
AACCCCAAATCTCAG 3', which is nt 1625-1639 of SEQ ID No. 11), 
and sequence 3(5' CAAAAAAGGGCAAAATGAAGG 3', which is nt 
1908-1928 of SEQ ID No. 11). The five a, forms identified are 
(1) a form that lacks sequence 3 called (expressed in 
skeletal muscle) , (2) a form that lacks sequence 1 called a,, 
(expressed in CNS) , (3) a form that lacks sequences i and 2 
called a,, expressed in aorta), (4) a form that lacks 
sequences 1, 2 and 3 called a,, (expressed in aorta) and (5) 
a form that lacks sequences l and 3 called a,, (expressed in 
aorta) . 

The sequences of - a,, are set forth in SEQ. ID Nos. 
29 - 32, respectively. 
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EXAMPLE VI: ISOIATIOH OP DMA EWCODINO A CHLCIOm CHAMMEL v 

SUBONIT PROM A HDmN BBAIN eONA LIBRARY CHMINBL y 

A. Isolation of DMA encoding the y subunit 

Approximately 1 x 10^ recombinants from a Xgtll -based 
human hippocampus cDNA library (Clontech catalog #HL108Bb, 
Palo Alto, CA) were screened by hybridization to a 484 bp 
sequence of the rabbit skeletal muscle calcium channel y 
subunit CDNA (nucleotides 621-626 of the coding sequence plus 
438 nucleotides of 3 ' -untranslated sequence) contained in 
vector YJlO [Jay, S. et al, (1990). Science 24^:490-492]. 
Hybridization was performed using moderate stringency 
conditions (20% deionized formamide, 5x Denhardt's, 6 x BSPE, 
0.2% SDS, 20 fxg/ml herring sperm DNA, 42oc) and the filters 
were washed under low stringency (see Example I.e.) . A plaque 
that hybridized to this probe was purified and insert DNA was 
subcloned into pGEM7Z. This cDNA insert was designated 7I.4. 

B. Characterization of 7I.4 

Yl.4 was confirmed by DNA hybridization and characterized 
by DNA sequencing. The 1500 bp Sstl fragment of 7I.4 
hybridized to the rabbit skeletal muscle calcium channel y 
subunit cDNA 7JIO on a Southern blot. SEQ analysis of this 
fragment revealed that it contains of approximately 500 nt of 
human DNA sequence and -lOOO nt of Xgtll sequence (included 
due to apparent destruction of one of the EcoRI cloning sites 
in Xgtll) . The human DNA sequence contains of 129 nt of 
coding sequence followed immediately by a translational STOP 
codon and 3' untranslated sequence (SEQ ID No. 14) . 

To isolate the remaining 5' sequence of the human y 
subunit CDNA, human CNS cDNA libraries and/or preparations of 
raRNA from human CNS tissues can first be assayed by nucleic 
acid amplification analysis methods using oligonucleotide 
primers based on the y cDNA-specif ic sequence of 7I.4. 
Additional human neuronal 7 sxibunit -encoding DNA can be 
isolated from cDNA libraries that, based on the results of the 
nucleic acid amplification analysis assay, contain 7-specific 
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antplifiable cDNA. Alternatively, cDNA libraries can be 
constructed from mRNA preparations that, based on the results 
of the nucleic acid amplification analysis assays, contain 7- 
specific anplifiable transcripts. Such libraries are 
constructed by standard methods using oligo dT to prime first - 
strand cDNA synthesis from poly A* RNA (see Example I^B.). 
Alternatively, first*strand cDNA can be specified by priming 
first-strand cDNA synthesis with a y cDlC\-specif ic 
oligonucleotide, based on the humain DNA sequence in 71.4. A 
cDNA library can then be constructed based on this first- 
strand synthesis cuid screened with the y-specific portion of 
yl.4. 

EXAMPLE VII: ISOLATION OF CDNA CLONES ENCODING THE HDMAN 
NEURONAL Ca CHANNEL /S, SUBUNIT 

Isolation of DNA Encoding human calcitm channel fij 
subunits 

Sequencing of clones isolated as described in Example III 
revealed a clone encoding a human neuronal calcium channel ft 
subunit (designated ft^ see, SEQ ID No. 26) . An 
oligonucleotide based on the 5' end of this clone was used to 
prime a human hippocampus cDNA library. The library was 
screened with this ft clone under conditions of low to medium 
stringency (final wash 0.5 X SSPE, 50"" C). Several 
hybridizing clones were isolated and sequenced. Among these 
clones were those that encode ft^r P3D ^2e- Fo^t example, 
the sequence of ftc is set forth in SEQ ID NO. 37, and the 
sequeence of ft^ is set forth in SEQ ID No. 38. 

A randomly primed hippocanpus library was then screened 
using a combination of the clone encoding ft^ and a portion of 
the ft clone deposited under ATCC Accession No. 63048., 
Multiple hybridizing clones were isolated. Among these were 
clones designated P±Q1, 0102 and /?104. jSlOl appears to 
encodes the 5' end of a splice variant of ft, designated ft^. 
P1Q2 and 0104 encode portions of the 3' end of ft. 

It appears that the ft splice variants include 
nucleotides 182-2294 of SEQ ID No. 26 and differ only between 
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the start codon cOid nucleotides that correspond to 212 of SBQ. 
ID No. 26. 

BXAMPZiE VIII: ISOLATION OF cDNA CLONES ENCODING HUMAN 
CALCIUM CHANNEL ff^ and SUBtXNITS 

A. Isolation of cDNA Clones Encoding a Htiman fi^ Subunit 
A clone containing a translation initiation codon and 

approximately 60% of the 0^ coding sequence was obtained from 
a human cerebellum cDNA library (see nucleotides 1-894 of 
Sequence ID No. 27) . To obtain DNA encoding the remaining 3' 
portion of the coding sequence, a human cerebellum cDNA 
library was screened for hybridization a nucleic acid 
amplification product under high stringency hybridization and 
wash conditions. Hybridizing clones are purified and 
characterized by restriction enzyme mapping and DNA sequence 
analysis to identify those that contain sequence corresponding 
to the 3' end of the subunit coding sequence and a 
termination codon. Selected clones are ligated to the clone 
containing the 5' half of the 0^ coding sequence at convenient 
restriction sites to generate a full-length cDNA encoding a fi^ 
sxabunit. "The secpience of a full-length 0^ clone is set forth 
in SEQ ID No. 27; the amino acid sequence is set forth in SEQ 
ID No. 28. 

B. Isolation of cDNA Clones Encoding a Human 03 
Subunit 

Sequencing of clones isolated as described in Example III 
also revealed a clone encoding a human neuronal calcium 
channel ^3 subunit. This clone has been deposited as plasmid 
/31.42 (ATCC Accession No. 69048) . 

To isolate a full-length cDNA clone encoding a complete 
jSj subunit, a human hippocampus cDNA library (Stratagene, La 
Jolla, CA) was screened for hybridization to a 5' EcoRI-PstI 
fragment of the cDNA encoding using lower stringency 

hybridization conditions (20% deionized formamide, 200 fig/ral 
sonicated herring sperm DNA, 5X SSPE, 5X Denhardt's solution, 
42° C) cmd wash conditions. One of the hybridizing clones 
contained both trsmslation initiation and termination codons 
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and encodes a complete ^3 subunit designated 0^^^ (Sequence ID 
No. 19) . In vitro transcripts of the cDNA were prepared and 
injected into Xenopus oocytes along with transcripts of the 
ajB-i and a^^ cDNAs using methods similiar to those described in 
Example IX -D. Two-electrode voltage clamp recordings of the 
oocytes revealed significant voltage -dependent inward Ba^* 
currents. 

An additional /?3 subunit -encoding clone, designated ^83.2, 
was obtained by screening a human cerebellum cDNA library for 
hybridization to the nucleic acid amplification product 
referred to in Example VIII. A. under lower stringency (20% 
deionized formamide, 200 /xg/ml sonicated herring sperm DNA, 5X 
SSPE, 5X Denhardt's solution, 42° C) hybridization and wash 
conditions. The 5' ends of this clone (Sequence ID No. 20, /S3. 
2) and the first ^3 subxinit, designated P^.^, (Sequence ID No. 
19) differ at their 5' ends and are splice variants of the /S3 
gene. 

EXAMPLE XX: RECQMBINANT EXPRESSION OF HUMAN NEX7R0NAL 

CALCIUM CHANNEL SUBUNIT -ENCODING cDNA AND RNA 
TRANSCRIPTS IN MAMMALIAN CELLS 

A. Recombinant E3q)ro88ion of the Human 

Neuronal Calcium Chaxinal a, subunit cDNA 
in D644 Cells 

1. Stable transf action of DG44 cells 

D644 cells [dhfr' Chinese hamster ovary cells; see, e.g., 

Urlaub, G. et al. (1986) Sam. Cell Molec. Genet. 12:555-566] 

obtained from Lawrence Chasin at Columbia University were 

stably transfected by CaPO^ precipitation methods [Wigler et 

al. (1979) Proc, Natl. Acad. Sci. USA 7^:1373-1376] with 

pSV2dhfr vector containing the human neuronal calcium channel 

Ofa-subunit cDNA (see Exait^le IV) for polycistronic 

expression/selection in transfected cells. Transfectants were 

grown on 10% DMEM medium without hypoxanthine or thymidine in 

order to select cells that had incorporated the expression 

vector. Twelve transfectant cell lines were established as 

indicated by their ability to survive on this medium. 
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2. Analysis of or, subiinlt cDNA expression in 
transfected DG44 cells 

Total RNA was extracted according to the method of 

Bimboim [(1988) Nuc. Acids Res. 1^:1487-1497] from four of 

the DG44 cell lines that had been stably transfected with 

pSV2dhfr containing the human neuronal calcium channel ot^ 

siibunit cDNA. RNA (--15 fig per lane) was separated on a 1% 

agarose formaldehyde gel, transferred to nitrocellulose and 

hybridized to the random-primed human neuronal calcium channel 

Ofa CDNA (hybridization: 50% formamide, 5 x SSPE, 5 x 

Denhardt's, 42» C*; wash :0.2 x SSPE, 0.1% SDS, 65<> C). 

Northern blot analysis of total RNA from four of the DG44 cell 

lines that had been stably transfected with pSV2dhfr 

containing the human neuronal calcium channel subunit cDNA 

revealed that one of the four cell lines contained hybridizing 

mRNA the size expected for the transcript of the subunit 

CDNA (5000 nt based on the size of the cDNA) when grown in the 

presence of 10 mM sodium butyrate for two days. Butyrate 

nonspecifically induces transcription and is often used for 

inducing the SV40 early promoter [Gorman, C. and Howard, B. 

(1983) Nucleic Acids Res, 21:1631] . This cell line, 440^-9, 

also produced mRNA species smaller (several species) and 

larger (6800 nt) than the size expected for the transcript of 

the Qfj CDNA (5000 nt) that hybridized to the ofj cDNA-based 

probe. The 5000- and 6800 -nt transcripts produced by this 

transfectant should contain the entire subunit coding 

sequence and therefore should yield a full-length ttj siabunit 

protein. A weaJcly hybridizing 8000 -nucleotide transcript was 

present in imtransfected and transfected DG44 cells. 

Apparently, DG44 cells transcribe a calcium channel c^a subunit 

or similar gene at low levels. The level of expression of 

this endogenous subunit transcript did not appear to be 

affected by exposing the cells to butyrate before isolation of 

RNA for northern analysis. 

Total protein was extracted from three of the DG44 cell 

lines that had been stably transfected with pSV2dhfr 
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containing the human neuronal calcium channel subunit cDNA. 
Approximately 10'' cells were sonicated in 300 ^1 of a solution 
containing 50 mM HEPES, 1 mM EDTA, 1 mM PMSF. An equal volume 
of 2x loading dye [Laemmli, U.K. (1970) . Nature 227:680] was 
added to the samples and the protein was subjected to 
electrophoresis on an 8% polyacrylamide gel and then 
electrotransf erred to nitrocellulose. The nitrocellulose was 
incubated with polyclonal guinea pig antisera (1:200 dilution) 
directed against the rabbit skeletal muscle calcium chcuinel qtj 
subunit (obtained from K. Campbell, University of Iowa) 
followed by incubation with ["^I] -protein A. The blot was 
exposed to X-ray film at -70*> C. Reduced samples of protein 
from the transfected cells as well as from untransfected D644 
cells contained immunoreactive protein of the size expected 
for the a, sxibunit of the human neuronal calcium channel (130- 
150 kDa) . The level of this immunoreactive protein was higher 
in 44a2-9 cells that had been grown in the presence of 10 mM 
sodixim butyrate than in 44a2-9 cells that were grown in the 
absence of sodium butyrate. These data correlate well with 
those obtained in northern analyses of total RNA from 44a2-9 
and untransfected DG44 cells. Cell line 44^2-9 also produced 
a 110 kD immunoreactive protein that may be either a product 
of proteolytic degradation of the full-length subxinit or a 
product of translation of one of the shorter (<5000 nt) mRNAs 
produced in this cell line that hybridized to the ot^ subiinit 
cDNA probe* 

B. Expression of DNA encoding human 
neuronal calcium channel a^, cr, and fi^ 
subunitB in HEK cells 

Human embryonic kidney cells (HEK 293 cells) were 

transiently and stably transfected with human neuronal DKA 

encoding calcium channel subunits. Individual transfectants 

were analyzed electrophysiologically for the presence of 

voltage -activated barium currents and functional recombinant 

voltage -dependent calcium channels were. 
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1. Transfiaction of HER 253 cells 

Separate expression vectors containing DNA encoding human 
neuronal calcium channel aij>, and /S^ subunits, plasmids 
pVDCCIIl (A) , pHBCaCHajA, and pHBCaCHiff.^RBS (A) , respectively, 
were constructed as described in Examples II. A. 3, IV. B, and 
III . B . 3 . , respect i vely . These three vectors were used to 
transiently co-transfect HEK 293 cells. For stable 
transfection of HEK 293 cells, vector pHBCaCH/S^RBS (A) (Example 
III.B.3.) was used in place of pHBCaCHi8i.RBS (A) to introduce 
the DNA encoding the jSj subunit into the cells along with 
pVDCCIII(A) and pHBCaGHttjA, 

a. Trajuilent transfection 
Expression vectors pVDCCIll(A), pHBCaCHajA and 
pHBCaCH^i^RBS (A) were used in two sets of transient 
transfectidns of HEK 293 cells (ATCC Accession No. CRL1573) . 
In one transfection procedure, HEK 293 cells were transiently 
cotransfected with the subtinit cDNA expression plasmid, the 
subunit cDNA esqpression plasmid, the 15^ subunit cDNA 
expression plasmid and plasmid pCMV^gal {Clontech 
Laboratories, Palo Alto, CA) . Plasmid pCMV^gal contains the 
iacZ gene (encoding E. coli p-galactosidase) fused to the 
cytomegalovirus (CMV) promoter and was included in this 
transfection as a marker gene for monitoring the efficiency of 
transfection. In the other transfection procedure, HEK 293 
cells were transiently co-transfected with the ot^ subunit cDNA 
expression plasmid pVDCCIII(A) and pCMV^gal. In both 
transfections, 2-4 x 10* HEK 293 cells in a 10-cm tissue 
culture plate were transiently co-transfected with 5 ^g of 
each of the plasmids included in the esqperiment according to 
stcuidard CaPO^ precipitation transfection procediires (Wigler 
et al. (1979) Proc. Natl. Acad. Scl. USA 7<?:1373-1376) . The 
transfectants were analyzed for jS-galactosidase expression by 
direct staining of the product of a reaction involving p- 
galactosidase and the X-gal substrate [Jones, J.R. (1986) EMBO 
5:3133-3142] and by measurement of /3-galactosidase activity 
[Miller, J.H. (1972) Experiments in Molecular Genetics, pp. 
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352-355, Cold Spring Harbor Press]. To evaluate subunit CDNA 
expression in these transf ectants, the cells were analyzed for 
subunit transcript production (northern analysis), subunit 
protein production (immunoblot analysis of cell lysates) and 
functional calcium channel expression (electrophysiological 
analysis) . 

b. Stable transfiection 

HEK 293 cells were transfected using the calcium 

phosphate transfection procedure [Current Protocols in 

Molecular Biology, Vol. 1, Wiley Inter-Science, Supplement 14, 

Unit 9.1.1-9.1.9 (1990)] . Ten-cin plates, each containing one- 

to- two million HEK 293 cells, were transfected with i ml of 

DNA/calcium phosphate precipitate containing 5 /ig pVDCCIII (A) , 

5 iig pHBCaCHofjA, Sfig pmCaCap^S (A) , 5 fig pCMVBgal and 1 /ig 

pSV2neo (as a selectable marJcer) . After 10-20 days of growth 

in media containing 500 fig G418, colonies had formed and were 

isolated using cloning cylinders. 

2. Analysis of HEK 293 cells transiently 
transfected with DNA encoding human neuronal 
calcium channel subunits 

a. Analysis of A-galactosidase es^ression 
Transient transfectants were assayed for ^-galactosidase 

expression by jS-galactosidase activity assays (Miller, J.H., 
(1972) Experiments in Molecular Genetics, pp. 352-355, Cold 

Spring Harbor Press) of cell lysates (prepared as described in 

Example VII. A. 2) and staining of fixed cells (Jones, J.R. 
(1986) EMBO 5:3133-3142). The results of these assays 

indicated that approximately 30% of the HEK 293 cells had been 

transfected. 

b. Northern analysis 

PolyA+ RNA was isolated using the Invitrogen Fast Trak 
Kit (Invitrogen, San Diego, CA) from HEK 293 cells transiently 
transfected with DNA encoding each of the a^, and /S^ 

subunits and the lacZ gene or the ot^ sxibunit and the lacZ 
gene. The RNA was subjected to electrophoresis on an agarose 
gel and transferred to nitrocellulose. The nitrocellulose was 
then hybridized with one or more of the following radiolabeled 



-91- 



probes: the lacZ gene, human neuronal calcium channel a^^ 
subunit -encoding cDNA, human neuronal calcium channel 
subunit -encoding cDNA or human neuronal calcium channel 
subunit -encoding cDNA. Two transcripts that hybridized with 
the ofj subunit -encoding cDNA were detected in HEK 293 cells 
transfected with the DNA encoding the a^, a^, and jS^ siO^miits 
and the lacZ gene as well as in HEK 293 cells transfected with 
the ofi subunit cDNA and the lacZ gene. One mRKA species was 
the size expected for the transcript of the subunit cDNA 
(8000 nucleotides) . The second RNA species was smaller (4000 
nucleotides) than the size expected for this transcript- RNA 
of the size expected for the transcript of the lacZ gene was 
detected in cells trzuisfected with the a^, and subunit- 
encoding cDNA and the lacZ gene and in cells transfected with 
the subunit cDNA and the lacZ gene by hybridization to the 
lacZ gene sequence, 

RNA from cells transfected with the a^, 0I2 and subunit- 
encoding cDNA and the iacZ gene was also hybridized with the 
Qfj and subunit cDNA probes. Two mRNA species hybridized to 
the ofa subunit cDNA probe. One species was the size expected 
for the transcript of the a, subunit cDNA (4000 nucleotides) . 
The other species was larger (6000 nucleotides) than the 
expected size of this transcript- Multiple RNA species in the 
cells CO- transfected with ot^, Ofa and /Si subunit -encoding cDNA 
and the lacZ gene hybridized to the /S^ subunit cDNA probe- 
Multiple P subunit transcripts of varying sizes were produced 
since the P subunit cDNA expression vector contaixis two 
potential polyA^ addition sites. 

c. Electrophysiological analysis 

Individual transiently transfected HEK 293 cells were 
assayed for the presence of voltage -dependent barium currents 
using the whole-cell variant of the patch clamp technique 
[Hamill et al. (1981). Pflugers Arch. 391:85-100]. HEK 293 
cells trauisiently transfected with pOIV/Sgal only were assayed 
for baritun currents as a negative control in these 
experiments. The cells were placed in a bathing solution that 
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contained barium ions to serve as the current carrier. 
Choline chloride, instead of NaCl or KCl, was used as the 
major salt component of the bath solution to eliminate 
currents through sodium and potassium channels. The bathing 
solution contained 1 raM MgClj and was buffered at pH 7.3 with 
10 xm HEPES {pH adjusted with sodium or tetraethylannnonium 
hydroxide) . Patch pipettes were filled with a solution 
containing 135 mM CsCl, 1 iriM MgCl^, 10 mM glucose, 10 mM EGTA, 
4 mM ATP and 10 mM HEPES (pH adjusted to 7.3 with 
tetraethylammonium hydroxide) . Cesium and tetraethylammonium 
ions block most types of potassium channels. Pipettes were 
coated with Sylgard (Dow-Coming, Midland, MI) and had 
resistances of 1-4 megohm. Currents were measured through a 
500 megohm headstage resistor with the Axopatch IC (Axon 
Instruments, Poster City, CA) amplifier, interfaced with a 
Labmaster (Scientific Solutions, Solon, OH) data acquisition 
board in an IBM- compatible PC, PClan^) (Axon Instruments) was 
used to generate voltage commands and acquire data. Data were 
analyzed with pClaxiip or Quattro Professional (Borland 
International, Scotts Valley, CA) programs . 

To apply drugs, "puffer" pipettes positioned within 
several micrometers of the cell under study were used to apply 
solutions by pressure application. The drugs used for 
pharmacological characterization were dissolved in a solution 
identical to the bathing solution. Samples of a lo mM stock 
solution of Bay K 8644 (RBI, Natick, MA), which was prepared 
in DNSO, were diluted to a final concentration of i |£M in 15 
mM Ba^*- containing bath solution before they were applied. 

Twenty-one negative control HEK 293 cells (transiently 
transfected with the JacZ gene expression vector pCMV^gal 
only) were analyzed by the whole-cell variant of the patch 
clamp method for recording currents. Only one cell displayed 
a discemable inward barium current; this current was not 
affected by the presence of 1 /iM Bay K 8644. In addition, 
application of Bay K 8644 to four cells that did not display 
Ba^* currents did not result in the appearance of any currents . 
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Two days after treoisient transfection of HEK 293 cells 
with Qfi, 0I2 and fi^ subunit- encoding cDNA and the lacZ gene, 
individual transfectants were assayed for voltage -dependent 
bariiim currents. The currents in nine trauisfectants were 
recorded. Because the efficiency of transfection of one cell 
can vary from the efficiency of transfection of another cell, 
the degree of expression of heterologous proteins in 
individual transfectants varies and some cells do not 
incorporate or express the foreign DNA. Inward barium 
currents were detected in two of these nine transfectants. In 
these assays, the holding potential of the membrane was -90 
mV. The membrane was depolarized in a series of voltage steps 
to different test potentials and the current in the presence 
and absence of l /tM Bay K 8644 was recorded. The inward 
barium current was significantly enhanced in magnitude by the 
addition of Bay K 8644. The largest inward barium current 
(-160 pA) was recorded when the membrane was depolarized to 0 
mV in the presence of 1 fM Bay K 8644. A comparison of the 
I-V curves, generated by plotting the largest current recorded 
after each depolarization versus the depolarization voltage, 
corresponding to recordings conducted in the ed>sence and 
presence of Bay K 8644 illustrated the enhancement of the 
voltage-activated current in the presence of Bay K 8644. 

Pronounced tail currents were detected in the tracings 
of currents generated in the presence of Bay K 8644 in HEK 293 
cells transfected with a^, of, and /S^ subunit -encoding cDNA and 
the iacZ gene, indicating that the recombinemt calcium 
channels responsible for the voltage-activated barium currents 
recorded in this transfected appear to be DHP- sensitive. 

The second of the two transfected cells that displayed 
inward barium currents expressed a -50 pA current when the 
membrane was depolarized from -90 mV. This current was nearly 
conpletely blocked by 200 fiM cadmium, an established calcium 
channel blocker. 

Ten cells that were transiently transfected with the DNA 
encoding the ot^ subunit and the lacZ gene were analyzed by 



whole-cell patch clamp methods two days after transfection. 
One of these cells displayed a 30 pA inward barium current. 
This current amplified 2-fold in the presence of 1 /zM Bay K 
8644. Furthermore, small tail currents were detected in the 
presence of Bay K 8644. These data indicate that expression 
of the human neuronal calcium channel subtinit -encoding cDNA 
in HEK 293 yields a functional DHP-sensitive calcium channel. 

3. Analysis of HEK 293 cells stably transfected 
with DMA encoding human neuronal calcium 
channel subxmits 

Individual stably transfected HEK 293 cells were assayed 

electrophysiologically for the presence of voltage -dependent 

barium currents as described for electrophysiological analysis 

of transiently transfected HEK 293 cells (see Example 

VII. B. 2 -c) . In an effort to maximize calcium chsumel activity 

via cyclic-AMP-dependent kinase-mediated phosphorylation 

[Pelzer, et al. (1990) Rev. Physiol. Biochem. Pbazmacol. 

114:107-207], cAMP (Na salt, 250 /iM) was added to the pipet 

solution and forskolin (10 /iM) was added to the bath solution 

in some of the recordings. Qualitatively similar results were 

obtained whether these confounds were present or not. 

Barium currents were recorded from stably transfected 

cells in the absence and presence of Bay K 8644 (1 /iM) . When 

the cell was depolarized to -10 mV from a holding potential of 

-90 mV in the absence of Bay K 8644, a current of 

approximately 35pA with a rapidly deactivating tail current 

was recorded. During application of Bay K 8644. an identical 

depolarizing protocol elicited a current of approximately 75 

pA, accompanied by an augmented and prolonged tail cvirrent. 

The peak magnitude of currents recorded from this same cell as 

a function of a series of depolarizing voltages were assessed. 

The responses in the presence of Bay K 8644 not only 

increased, but the entire current-voltage relation shifted 

about -10 mV. Thus, three typical hallmarks of Bay K 8644 

action, namely increased current magnitude, prolonged tail 

cxirrents, and negatively shifted activation voltage, were 
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observed, cleaurly indicating the expression of a DHP- sensitive 

calcium channel in these stably transfected cells. No such 

effects of Bay K 8644 were observed in untransfected HEK 293 

cells, either with or without cAMP or forskolin. 

Use of pCMV-based vectors and pcDNAl-based vectors 
for e3q)re88ion of DKA encoding human neuronal 
calcium channel subtmits 

1. Preparation of constructs 

Additional esqpression vectors were constructed using 
pCMV. The full-length cDNA from pVDCCIII(A) (see E3caii?)le 

II. A.B.d), the full-length of, cDNA, contained on a 3600 bp 
JScoRI fragment from HBCaCHaa (see Example IV. B) and a full- 
length subunit cDNA from pHBCaCH^^i^S (A) (see Example 

III . B . 3 ) were separately subcloned into plasmid pCMV/9gal . 
Plasmid pCMVjSgal was digested with Notl to remove the lacZ 
gene. The remaining vector portion of the plasmid, referred 
to as pCMV, was blunt-ended at the Notl sites. The full- 
length Ofj-encoding DNA and /S^ -encoding DNA, contained on 
separate EcdRl fragments, were isolated, blunt-ended and 
separately ligated to the blunt -ended vector fragment of pCMV 
locating the cDNAs between the CMV promoter and SV40 
polyadenylation sites in pCMV. To ligate the ajQ-encoding cDNA 
with pCMV, the restriction sites in the polylinkers 
immediately 5' of the CMV promoter and immediately 3' of the 
SV40 polyadenylation site were removed from pCMV. A 
polyl inker was added at the Notl site. The polylinker had the 
following sequence of restriction enzyme recognition sites: 
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GGCCGC 
CG 
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EcoRl 
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GT 

CACCGG 
t 

Destroys JViot 



The ofip-encoding DNA, isolated as a BaMll/Xhol fragment from 
pVDCCIII(A), was then ligated to -Xball/Sall-digested pCMV to 
place it between the CMV promoter and SV40 polyadenylation 
site. 

Plasmid pCMV contains the CMV promoter as does pcDNAl, 
but differs from pcDNAl in the location of splice donor/splice 
acceptor sites relative to the inserted subunit -encoding DNA. 
After inserting the subunit- encoding DNA into pCMV, the splice 
donor/splice acceptor sites are located 3' of the CMV promoter 
and 5' of the subunit -encoding DNA start codon. After 
inserting the subunit -encoding DNA into pcDNAl, the splice 
donor/splice acceptor sites are located 3' of the siabxmit cDNA 
stop codon. 

2. Transf action of HEK 293 cells 

HEK 293 cells were . transiently co-trcuisf ected with the 

Of ofa and fi^ subunit -encoding DNA in pCMV or with the a„>, a, 

and P subunit -encoding DNA in pcDNAl (vectors pVDCCIII (A) , 

pHBCaCHaaA and pHBCaCH^ij^S (A) , respectively) , as described in 

Example Vll.B.l.a. Plasmid pCMV^gal was included in each 

trcuisfection as a measure of transf ection efficiency. The 

results of /8-galactosidase assays of the transf ectants (see 

Example VII.B.2.), indicated that HEK 293 cells were 

transf ected equally efficiently with pCMV- and pcDNAl -based 

plasmids. The pcDNAl -based plasmids, however, are presently 

preferred for expression of calcium channel receptors. 

D. Expression in Xenopus laevis oocytes of RNA 
encoding human neuronal calcium channel subunits 

Various combinations of the transcripts of DNA encoding 

the humem neuronal a^, ai2 Pi subunits prepared in vitro 

were injected into Xenopus laevis odcytes. Those injected 

with combinations that included a^ exhibited voltage-activated 

barium currents. 
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1. Preparation of transcriptfi 

Transcripts encoding the human neuronal calcium channel 
Qfjo, ofa and /Sj subunits were synthesized according to the 
instructions of the mCAP raRNA CAPPING KIT (Strategene, La 
Jolla, CA catalog #200350). Plasmids pVDCC III.RBS(A), 
containing pcDNAl and the a^j, cDNA that begins with a ribosome 
binding site and the eighth ATG codon of the coding sequence 
(see Example III,A.3.d), plasmid pHBCaCHtt^A containing pcDNAl 
and an subunit cDNA (see Example IV) , and plasmid 
pHBCaCHiSibRBS (A) containing pcDNAl and the DNA lacking 
intron sequence and containing a ribosome binding site (see 
Example III), were linearized by restriction digestion. The 
ofiD cDNA- and of^ subunit -encoding plasmids were digested with 
Xhol, and the /?i subunit- encoding plasmid was digested with 
BcoRV. The DNA insert was transcribed with T7 RNA polymerase. 

2. Injection of o5cytes 

Xenopus laevis oocytes were isolated and def olliculated 
by collagenase treatment and maintained in 100 mM NaCl, 2 mM 
KCl, 1.8 mM CaCla, 1 mM MgClj, 5 mM HEPES, pH 1.6, 20 fig/ml 
anpicillin and 25 /ig/ml streptomycin at 19-25*>C for 2 to 5 
days after injection and prior to recording. For each 
transcript that was injected into the o6cyte, 6 ng of the 
specific mRNA was injected per cell in a total volume of 50 
nl, 

3. Intracellular voltage recordings 

Injected odcytes were examined for voltage -dependent 
barium currents using two-electrode voltage clamp methods 
[Dascal, N- (1987) CRC Crit. Rev. Biochem. 22:317] . The 
pClamp (Axon Instrtiments) software package was used in 
conjunction with a Labmaster 125 kHz data acquisition 
interface to generate voltage commands and to acquire and 
cuialyze data. Quattro Professional was also used in this 
analysis. Current signals were digitized at 1-5 kHz, and 
filtered appropriately. The bath solution contained of the 
following: 40 mM BaClj, 36 mM tetraethyl ammonium chloride 
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(TEA-Cl), 2 mM KCl, 5 tnM 4-aminopyridine, 0.15 mM niflumic 
acid, 5 mM HEPES, pH 7.6. 

a. Electrophysiological analysis of oocytes 
Injected with transcripts encoding the 
human neuronal calcium channel a^, a, and 
/7x*8ubunits 

Uninj acted oocytes were examined by two-electrode voltage 
clamp methods and a very small (25 nA) endogenous inwcurd Ba^^ 
current was detected in only one of seven analyzed cells. 

Oocytes coinjected with ot^, ofj and subunit transcripts 
expressed sustained inward barium currents upon depolarization 
of the membrane from a holding potential of -90 mV or -50 mV 
(154 ± 129 nA, ns21) . These currents typically showed little 
inactivation when test pulses ranging from 140 to 700 msec, 
were administered. Depolarization to a series of voltages 
revealed currents that first appeared at approximately -30 mV 
euid peaked at approximately 0 mV, 

implication of the DHP Bay K 8644 increased the magnitude 
of the currents, prolonged the tail currents present upon 
repolarization of the cell and induced a hyperpol arizing shift 
in current activation. Bay K 8644 was prepared fresh from a 
stock solution in DMSO and introduced as a lOx concentrate 
directly into the 60 fil bath while the perfusion pump was 
turned off. The DMSO concentration of the final diluted drug 
solutions in contact with the cell never exceeded 0.1%. 
Control experiments showed that 0.1% DMSO had no effect on 
membrane currents. 

implication of the DHP antagonist nifedipine (stock 
solution prepared in DMSO and applied to the cell as described 
for application of Bay K 8644) blocked a substeuitial fraction 
(91 ± 6%, n=7) of the inward barium current in oocytes 
coinjected with transcripts of the ot^j^, 0I2 and jS^ subunits. A 
residual inactivating component of the inward barium current 
typically remained after nifedipine application. The inward 
bariiim current was blocked completely by 50 Cd'*, but only 
approximately 15% by 100 iM Ni^*. 
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The effect of cjCgTX on the inward barium currents in 

oocytes co- injected with transcripts of the a^o, a^, and 

subunits was investigated, wCgTX (Bachem, Inc., Torrance CA) 

was prepared in the 15 mM BaClj bath solution plus 0.1% 

cytochrome C (Sigma) to serve as a carrier protein. Control 

e^qperiments showed that cytochrome C had no effect on 

currents. A series of voltage pulses from a -90 mV holding 

potential to 0 mV were recorded at 20 msec, intervals. To 

reduce the inhibition of wCgTX binding by divalent cations, 

recordings were made in 15 mM BaClj, 73.5 mM 

tetraethyl ammonium chloride, and the remaining ingredients 

identical to the 40 mM Ba^^ recording solution. Bay K 8644 was 

applied to the cell prior to addition to 6)CgTX in order to 

determine the effect of oiCgTX on the DHP-sensitive current 

coiziponent that was distinguished by the prolonged tail 

currents. The inward barium current was blocked weakly (54 ± 

29%, n=7) and reversibly by relatively high concentrations 

(10-15 fM) of Ci)CgTX. The test ciirrents and the acconpanying 

tail ctirrents were blocked progressively within two to three 

minutes after application of uCgTX, but both recovered 

partially as the (oCgTX was flushed from the bath. 

b. Analysis of oocytes injected with only 
a transcripts encoding the human 
neuronal caldxim channel or 
transcripts encoding an a„ and other 
subunits 

The contribution of the a, suid /ff^ subunits to the inward 
barium current in oocytes injected with transcripts encoding 
the Qfip, ttj and subunits was assessed by expression of the 
subunit alone or in combination with either the subtinit 
or the ofa subunit. In oocytes injected with only the 
transcript of a a^^ cDNA, no Ba^* currents were detected (n=3 ) . 
In oocytes injected with transcripts of ofu, and jS^ cDNAs, small 
(108 ± 39 nA) Ba^* currents were detected upon depolarization 
of the membrane from a holding potential of -90 mV that 
resembled the currents observed in cells injected with 
transcripts of a^, and cDNAs, although the magnitude of 
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the current was less. In two of the four o6cytes injected 

with trcuiscripts of the ofu,- encoding and jSj-encoding DNA, the 

Ba^* currents exhibited a sensitivity to Bay K 8644 that was 

similar to the Bay K B644 sensitivity of Ba^* currents 

expressed in oocytes injected with transcripts encoding the cr„ 

of^-, and /3x subunits. 

Three of five o5cytes injected with trcUiscripts encoding 

the and ofa subunits exhibited very small Ba^* currents (15- 

30 nA) upon depolarization of the membrane from a holding 

potential of -90 mv. These barium currents showed little or 

no response to Bay K 8644. 

c. Analysis of oocytes injected with 
transcripts encoding the human neuronal 
calcium channel and/or subunit 

To evaluate the contribution of the Qr^- subunit to the 
inward barium currents detected in o&cytes co- injected with 
transcripts encoding the cr^o, 03 and subimits, o6cytes 
injected with transcripts encoding the human neuronal calcium 
channel ofj and/or svibimits were assayed for barium currents. 
Oocytes injected with transcripts encoding the ofa subunit 
displayed no detectable inward barium currents (n=5) . 
Oocytes injected with transcripts encoding a subunit 
displayed measurable (54 ± 23 nA, n^S) inward bcurium currents 
upon depolarization and oocytes injected with transcripts 
encoding the or, and /^^ subunits displayed inward barium 
currents that were approximately 50% larger (80 ± 61 nA, n=18) 
than those detected in oocytes injected with transcripts of 
the /Sj- encoding DNA only. 

The inward barium ctirrents in o6cytes injected with 
transcripts encoding the subunit or and subunits 
typically were first observed when the membrane was 
depolarized to -30 mV from a holding potential of -90 mV and 
peaUced when the membrane was depolarized to 10 to 20 mV. 
Macroscopically, the currents in oocytes injected with 
transcripts encoding the of, and subunits or with transcripts 
encoding the P^ sub\init were indistinguishable. In contrast 
to the currents in oocytes co- injected with transcripts of a^u, 
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0^2 and subunit cDNAs, these currents showed a significant 
inactivation during the test pulse and a strong sensitivity to 
the holding potential. The inward barium currents in oocytes 
co-injected with transcripts encoding the Qfj and subunits 
usually inactivated to 10-60% of the peak magnitude during a 
140-msec pulse and were significantly more sensitive to 
holding potential than those in oocytes co- injected with 
transcripts encoding the a^^, and /ff, subunits. Changing the 
holding potential of the membranes of oocytes co- injected with 
transcripts encoding the and jS, subunits from -90 to -50 mV 
resulted in an approximately 81% {n=ll) reduction in the 
magnitude of the inward barium current of these cells. In 
contrast, the inward barium current measured in odcytes co- 
injected with transcripts encoding the or^ and fi^ subunits 
were reduced approximately 24% (n=ll) when the holding 
potential was changed from -90 to -50 mV. 

The inward barium currents detected in oocytes injected 
with transcripts encoding the ot^ and subunits were 

pharmacologically distinct from those observed in oocytes co- 
injected with transcripts encoding the a^,, ot^ and subunits. 
OScytes injected with transcripts encoding the aa and fi^ 
subunits displayed inward barium currents that were 
insensitive to Bay K 8644 (n=ll) . Nifedipine sensitivity was 
difficult to measure because of the holding potential 
sensitivity of nifedipine and the current observed in o6cytes 
injected with transcripts encoding the a, and jSi subunits. 
Nevertheless, two o6cytes that were co-injected with 
transcripts encoding the and /S^ subunits displayed 

measurable (25 to 45 nA) inward barium currents when 
depolarized from a holding potential of -50 mV, These 
currents were insensitive to nifedipine (5 to 10 fM) . The 
inward barium currents in oocytes injected with transcripts 
encoding the ofj and sub\mits showed the same sensitivity to 
heavy metals as the currents detected in oocytes injected with 
transcripts encoding the a^,, and p^ subunits. 
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The inward barium current detected in oocytes injected 

with transcripts encoding the human neuronal and jS^ subunits 

has pharmacological and biophysical properties that resemble 

calcium currents in uninjected Xenopus oocytes. Because the 

amino acids of this human neuronal calcium channel subunit 

lack hydrophobic segments capable of forming transmembrane 

domains, it is unlikely that recombinant /S^ subunits alone can 

form an ion channel. It is more probable that a homologous 

endogenous ot^ subunit exists in oocytes and that the activity 

mediated by such an subunit is enhanced by expression of a 

human neuronal fi^ subimit. 

B. Baq»reBsion of UNA encoding human neuronal calcium 
channel a^, a^^ and ^^.2 subunits In HER cells 

1. Transfection of HEK cells 

The transient expression of the human neuronal aiB-i, 
and /Sj.a subunits was studied in HEK293 cells. The HEK293 
cells were grown as a monolayer culture in Dulbecco's modified 
Eagle's medium (Gibco) containing 5% def ined-supplemented 
bovine calf serum (Hyclone) plus penicillin G (100 U/ml) and 
steptomycin sulfate (100 /*g/ml) . HEK293 cell transfections 
were mediated by calcium phosphate as described above. 
Transfected cells were examined for inward Ba^* currents (Jg.) 
mediated by voltage -dependent Ca^* channels . 

Cells were transfected (2 x 10* per polylysine- coated 

plate. Standard transfections (10-cm dish) contained 8 ^g 

of pcDNAaiB.1, 5 iig of pHBCaCHa^A, 2 /ig pHBCaCHiffibRBS (A) (see. 

Examples II. A. 3, IV. B. and III) and 2 ^g of CMVfi (Clontech) p- 

glactosidase expression plasmid, and pUClB to maintain a 

constant mass of 20 ^g/ml. Cells were analyzed 48 to 72 hours 

after transfection. Transfection efficiencies (±10%), which 

were determined by in situ histochemical staining for 

/3-galactosidase activity (Sanes et al. (1986) EMBO J., 

5:3133), generally were greater than 50%. 

2. Electrophysiological analysis of transfectant 
currents 

a. Materials and methods 
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Properties of recombinantly expressed Ca^* channels were 
studied by whole cell patch-clamp techniques. Recordings were 
performed on transfected HEK293 cells 2 to 3 days after 
transfection. Cells were plated at 100,000 to 300,000 cells 
per polylysine-coated, 35-iran tissue culture dishes (Falcon, 
Qxnard, CA) 24 homrs before recordings- Cells were perfused 
with 15 mM BaCl^, 125 mM choline chloride, 1 mM MgClj, and 
10 mM Hepes (pH « 7.3) adjusted with tetraethylammonium 
hydroxide (bath solution) , Pipettes were filled with 135 mM 
CsCl, 10 mM EGTA, 10 mM Hepes, 4 mM Mg-adenosine triphosphate 
(pH = 7-5) adjusted with tetraethylammonixjm hydroxide. 
Sylgard (Dow-Coming, Midland, MI) -coated, fire-polished, and 
filled pipettes had resistances of 1 to 2 megohm before gigohm 
seals were established to cells. 

Bay K 8644 and nifedipine (Research Biochemicals, Natick, 
MA) were prepared from stock solutions (in dimethyl sulfoxide) 
and diluted into the bath solution. The dimethyl sulfoxide 
concentration in the final drug solutions in contact with the 
cells never exceeded 0.1%. Control eaqoeriments showed that 
0.1% dimethyl sulfoxide had no efect on membrane currents. 
(jCgTX (Bachem, Inc., Torrance CA) was prepared in th& 15 mM 
BaCla bath solution plus 0.1% cytochrome C (Sigma, St. Louis 
MO) to serve as a carrier protein. Control experiments 
showed that cytochrome C had no effect on currents. These 
drugs were dissolved in bath solution, and continuously 
applied by means of puffer pipettes as required for a given 
experiment. Recordings were performed at room temperature 
(22** to 25®C) . Series resistance compensation (70 to 85%) was 
employed to minimize voltage error that resulted from pipette 
access resistance, typically 2 to 3.5 megohm. Current signals 
were filtered (-3 dB, 4-pole Bessel) at a frequency of 1/4 to 
1/5 the sampling rate, which ranged from 0.5 to 3 kHz. 
Voltage commands were generated and data were acquired with 
CIiAMPEX (pClamp, Axon Instrximents , Foster City, CA) . All 
reported data are corrected for linear leak and capacitive 
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components. Exponential fitting of currents was performed 
with CLAMPFIT (Axon Instruments, Foster City, CA) . 
b. Results 

Transfectants were examined for inward Ba^* currents 
( Jb«) Cells cotraiisfected with DNA encoding a^^.i, ot^^,, and ^j., 
subuhlts expressed high-voltage-activated Ca^^ channels. 
first appeared when the membraner was depolarized from a 
holding potential of -90 mV to -20 mV and peaked in magnitude 
at 10 mV. Thirty-nine of 95 cells (12 independent 
transfections) had I^^ that ranged from 30 to 2700 pA, with a 
mean of 433 pA. The mean current density was 26 pA/pF, auid 
the highest density was 150 pA/pF. The Jb. typically increased 
by 2- to 20- fold during the first 5 minutes of recording. 
Repeated depolarizations during long records often revealed 
rimdown of J^a usually not exceeding 20% within 10 min. 
typically activated within 10 ms and inactivated with both a 
fast time constant ranging from 46 to 105 ms and a slow time 
constant ranging from 291 to 453 ms (n = 3) . Inactivation 
showed a complex voltage dependence, such that elicited at 
^20 mV inactivated more slowly than elicited at lower test 
voltages, possibly a result of an increase in the magnitude of 
slow compared to fast inactivation components at higher test 
voltages . 

Recombinant ot^.xCt^^.^ channels were sensitive to holding 
potential. Steady- state inactivation of l^, measured after 
a 30- to 60-s conditioning at various holding potentials, was 
approximately 50% at holding potential between -60 and -70 mV 
and approximately 90% at -40 mV. Recovery of Jb. from 
inactivation was usually incomplete, measuring 55 to 75% of 
the original magnitude within 1 min. after the holding 
potential was returned to more negative potentials, possibly 
indicating some rundown or a slow recovery rate. 

Recombinant criB-i<>'2bi^i-2 channels were also blocked 
irreversibly by ct>-CgTx concentrations ranging from 0.5 to 
10 iM during the time scale of the experiments. Application 
of 5 /xM toxin (n = 7) blocked the activity completely within 
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2 min . , and no recovery of was observed after washing 
w-CgTx from the bath for up to 15 min. d^* blockage (50 fiM) 
was rapid, complete, and reversible; the DHPs Bay K 8644 
(1 f^M; n = 4) or nifedipine {5 fiM; n = 3) had no discernable 
effect. 

Cells cotransfected with DNA encoding ^jb, and /ff^., 

subunits predominantly displayed a single class of saturable, 
high-affinity co-CgTx binding sites. The determined 

dissociation constant (IQ) value was 54.6 ± 14.5 pM (n = 4) . 
Cells transfected with the vector containing only 
^-galactosidase-encoding DNA or a^^jS- encoding DNA showed no 
specific binddLng. The binding capacity (B^) of the 
0fiB.ia2b/5-transfected cells was 28,710 ± 11,950 sites per cell 
(n = 4) . 

These results demonstrate that a^e.^a^b/S^.j- transfected 

cells express high-voltage-activated, inactivating Ca^* channel 

activity that is irreversibly blocked by cj-CgTx, insensitive 

to DHPs, and sensitive to holding potential. The activation 

and inactivation kinetics and voltage sensitivity of the 

channel formed in these cells are generally consistent with 

previous characterizations of neuronal N-type Ca^* channels. 

F. Ba^reasion of DNA encoding human neuronal calcium 
channel a^.^, ^2b' Pi-2 eubunits in BBK 

cells 

Significant Ba^* currents were not detected in 
vmtransfected HEK293 cells. Furthermore, untransf ected HEK293 
cells do not express detectable w-CgTx GVIA binding sites. 

In order to approximate the expression of a homogeneous 
population of trimeric ot^^, ofab and &^ protein complexes in 
transfected HEK293 cells, the a^^, ofsb and fi^ es^ression levels 
were altered. The efficiency of expression and assembly of 
channel complexes at the cell surface were optimized by 
adjusting the molar ratio of a^, and fi^ expression plasmids 
used in the transf ections . The transf ectants were analyzed 
for mRNA levels, w-CgTx GVIA binding and Ca^* channel current 
density in order to determine near optimal chaimel expression 
in the absence of imntunological reagents for evaluating 
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protein expression. Higher molar ratios of a^t appeared to 
increase calcium channel activity. 
1. Transfections 
HEK293 cells were maintained in DMEM {Gibco #320-1965AJ) , 
5.5% Z^fined/Supplemented bovine calf serum (Hyclone #A-2151*' 
L) , 100 U/ml penicillin G and 100 fig/ml streptomycin. Ca'*- 
phosphate based trauisient transfections were performed and 
analyzed as described above. Cells were co-transfected with 
either 8 /xg pcDNAlOfiB-i (described in Example II. C), 5 ng 
pHBCaCMofjA (see, Exait^le IV.BJ, 2 /ig pHBCaCH^^i^S (A) {fi^,^ 
expression plasmid; see Examples III. A. and IX.E.), and 2 fig 
pCMVi3-gal [Clontech, Palo Alto, CA] (2:1*8:1 molar ratio of 
Ca^"^ cheumel subunlt expression plasmids) or with 3 fig 
pcDNAla„.i or pcDNAlofia-af 11.25 fig pHBCaGHffaA, 0.75 or llO ^g 
pHBCaCH^ij^S (A) or pcDNAl/J^.^ and 2 fig pCMVj?-gal (2:10.9:1 
molar ratio of Ca^* channel subunit expression plasmids) . 
Plasmid pCMV/S-gal, a £-galactosidase e:q)ression plasmid, was 
included in the transfections as a mcurker to permit 
transfection efficiency estimates by histochemical staining. 
When less than three subunits were expressed, pCMVPL2, a pCMV 
promoter- containing vector that lacks a cDNA insert, was 
substituted to maintain equal moles of pCMV-based DNA in the 
transfection. pUC18 DNA was used to maintain the total mass 
of DNA in the transfection at 20 /xg/plate. 

RNA from the transfected cells was analyzed by Northezn 
blot analysis for calcium channel subunit mRMA egression 
using random primed ''P-labeled subunit specific probes. 
HEK293 cells co-transfected with aiB-i/ ^2h and ^^.j egression 
plasmids (8, 5 and 2 fig, respectively; molar ratio - 2:1.8:1) 
did not express equivalent levels of each Ca^* channel subunit 
mRNA. Relatively high levels of ot^^,j and /S^.j mRNAs were 
expressed, but significantly lower levels of mRNA were 
expressed. Based on autoradiograph exposures required to 
produce equivalent signals for all three mRNAs, of^i, transcript 
levels were estimated to be 5 to 10 times lower than a„.i and 
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/S^.j transcript levels. Untransf ected HEK293 cells did not 
express detectable levels of ot^B-^, a2b/ c>r /Sj.j niRNAs. 

To achieve equivalent Ca^* channel subiinit mRNA expression 
levels, a series of transfections was performed with various 
amoxints of ©ib-h suid /J^.j eaqpression plasmids. Because the 
ajB-i and fii.a iriRNAs were expressed at very high levels compared 
to ffjb mRNA, the mass of OfiB-i and fi^.j plasmids was lowered and 
the mass of ct2b plasmid was increased in the transfection 
experiments. Co- transfection with 3, 11.25 and 0.75 fig of a^B. 
1, ^21, and /3i.2 expression plasmids, respectively (molar ratio 
= 2:10.9:1), approached ecjuivalent expression levels of each 
Ca^* channel subunit mRNA, The relative molar quantity of 
e3q>ression plasmid to Oi^.i and fi^.a expression plasmids was 
increased 6 -fold. The mass of ttiB-i and jS^.a plasmids in the 
transfection was decreased 2. 67 -fold and the mass of 
plasmid was increased 2.25-fold. The 6-fold molar increase of 
ffab relative to a^B^i and fi^.a required to achieve near equal 
abundance mRNA levels is consistent with the previous 5- to 
10-fold lower estimate of relative ofab mRNA abundance. w-CgTx 
GVIA binding to cells transfected with various amounts of 
expression plasmids indicated that the 3, 11.25 and 0.75 fig of 
ttiB-w <^2h and /8i.a plasmids, respectively, improved the level of 
cell surface expression of channel complexes, Fiarther 
increases in the mass of ofab and Si.a expression plasmids while 
c^iB-i ^as held constamt, and alterations in the mass of the ot^^.i 
expression plasmid while ajb and fii.j were held constant, 
indicated that the cell surface expression of w-CgTx GVIA 
binding sites per cell was nearly optimal. All subsequent 
transfections were performed with 3, 11.25 and 0.75 /xg or 1.0 
fig of ofiB-i or a„-2. «2b and ^^-2 or /S1.3 expression plasmids, 
respectively. 

2, ^''x-(i9-CgTac GVIA binding to transfected cells 
Statistical analysis of the and values was 

performed using one-way analysis of variance (ANOVA) followed 
by the Tukey- Kramer test for multiple pairwise comparisons 
(pssO.05)- 
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Conibinations of human voltage-dependent Ca^* channel 
stibunits, ttiB-i/ ^18-2 1 ^2hi and /Si^/ were analyzed for 

saturation binding of "^I-w-CgTx GVIA. About 200,000 cells 
were used per assay, except for the ot^^.^, ci^h-2i 0fiB.i0f2b and a^^. 
jWab combinations which were assayed with l x 10* cells per tube 
The transfected cells displayed a single-class of saturable, 
high-affinity binding sites. The values for the dissociation 
constants (K^) and binding capacities (B„^) were determined for 
the different combinations. The results are summarized as 
follows : 

Subunit Conbination (pM) (sites /cell) 

OfiB-i«2b/3i-2 54.9 ± 11.1 {n=4) 45,324 ± 15,606 

aiB.ia2b/?i.3 53.2 ± 3-6 (n=3) 91,004 ± 37,654 

ttiB-i/3i.2 17.9 ± 1.9 (n=3) 5,756 ± 2,163 

«iB-ii8i-3 17.9 ± 1.6 (n=3) 8,729 ± 2,980 

otxB-^Ott^ 84 .IS ± 15.3 (n=3) 2,256 ± 356 

aiB-i 31.7 ± 4.2 (n=3) 757 ± 128 

aiB-aa2bft.2 53.0 ± 4.8 {n=3) 19,371 ± 3,798 

«iB-20f2bA-3 44-3 ± 8.1 (n=3) 37,652 ± 8,129 

«i8-2i8i-2 16.4 ± 1.2 (n=3) 2,126 ± 412 

ttiB-a/'i-a 22.2 ± 5,8 {n=3) 2,944 ± 1,168 

OfiB.^ajb N.D.' {n«3) N.D, 

OfiB.2 N.D. N.D. 

* N.D. « not detectable 

Cells transfected with subunit combinations lacking 

either the aia-i or the a^B., subunit did not exhibit any 

detectable "^I-w-CgTx GVIA binding (s 600 sites/cell) . "^^I-w- 

CgTx GVIA binding to HEK293 cells transfected with ol^^^2 alone 

or cL^^.2'^2b ^as too low for reliable Scatchard analysis of the 

data. Comparison of the and B^^ values revealed several 

relationships between specific combinations of subunits and 

the binding affinities and capacities of the transfected 

cells. In cells transfected with all three subunits, (ot^^, 

ia2bl5i-2-r aiB-ia2bi8i-3-r aiB.2Qfabi8i-2-i or aiB-2a2b/Si.3-transfectants) the 

values were indistinguishable (p>0.05), ranging from 44.3 
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is, 1 pM to 54.9 + 11.1 pM. In cells transfected with two- 
subunit combinations lacking the sxabunit toia-iiSi.j* ^ib-i/^io/ 
QfiB-2/?i.2 or o^iB.2/Si-3) the Kd values were significantly lower than 
the three-subunit combinations (p<0.01), rauiging from 16.4 ± 
1.2 to 22.2 ± 5.8 pM. Cells transfected with only the qt^b-i 
s\ib\mit had a Ka value of 31.7 ± 4.2 pM, a value that was not 
different from the two-subunit combinations lacking a^^ 
(p<0.05). As with the comparison between the four otj^^otzhPi 
versus otib/Si combinations, when the a^B-i was co-expressed with 
Qfjj,, the increased significantly (p<0.05) from 31.7 ± 4.2 to 
84.6 ± 5.3 pM. These data demonstrate that co-expression of 
the ofab subunit with otj^,^, ^iB-iiSi-a. a^.^^xo' aiB-2^i.2 or a^n.2P^^^ 
subunit combinations results in lower binding affinity of the 
cell surface receptors for "^I-w-CgTx GVIA. Ihe values 

of cells transfected with various subunit combinations also 
differed considereUoly. Cells transfected with the ofiB-i sxibunit 
alone expressed a low but detectable number of binding sites 
(approximately 750 binding sites/cell) . when the a^a.! subunit 
was co-expressed with the cyjb subuinit, the binding capacity 
increased approximately three-fold while co-e3q>ression of a 0^, 
2 or ^1.3 subunit with a^B-i resulted in 8- to 10 -fold higher 
expression of surface binding. Cells transfected with all 
three subunits expressed the highest number of cell surface 
receptors. The binding capacities of cells transfected with 
0fiB-i^2b^i-3 or a^-20f2bi8i.3 combinations were approximately two- fold 
higher than the corresponding combinations containing the jS^.a 
soibunit. Likewise, cells transfected with ofiB-iffai^i.a or 
<*iB-ittab/Si-3 combinations expressed approximately 2-5-fold more 
binding sites per cell than the corresponding combinations con- 
taining axB-2' cases, co-expression of the 
subunit with and /5i increased the surface receptor density 
compared to cells transfected with only the corresponding a^^ and 
j5i combinations; approximately 8 -fold for 0fiB-ia2bi'i-2 * 10 -fold for 
aiB.i«2bi5i.3f 9" fold for aiB~2a2biSi-2# and 13 -fold for QfiB.2ff2b/5i-3 • Thus, 
comparison of the values suggests that the toxin-binding 
subvinit, ttiB-i or is more efficiently expressed and 



-110- 



assembled on the cell surface when co-ex-pressed with either the 
(Vjb or the p^.^ or /S^.a subimit, and most efficiently expressed 
when and snbiinits are present. 
3 • Blectrophysiology 

Functional e3q>ression of of^B-i^2b^i-2 <^ib-i^i-2 subunit 

conibinations was evaluated using the whole-cell recording 
technique. Transfected cells that had no contacts with 
surrounding cells and simple morphology were used approximately 
48 hours after transfection for recording. The pipette 
solution was (in mM) 135 CsCl, 10 EGTA, 1 MgCl^, 10 HEPES, and 
4 itiM Mg-ATP (pH 7.3, adjusted with TEA-OH) . The external 
solution was (in mM) 15 BaClj, 125 Choline Cl, 1 MgClj, and 10 
HEPES (pH 7.3, adjusted with TEA-OH) « o-CgTX 6VIA (Bachem) was 
prepared in the external solution with 0.1% cytochrome C 
(Sigma) to serve as a carrier. Control experiments showed that 
cytochrome C had no effect on the Ba^* current. 

The macroscopic electrophysiological properties of Ba^* 
currents in cells transfected with various amounts of the 
expression plasmid with the relative amounts of a^^.^ and /S^.^ 
plasmids held constant were examined. The anplitudes and 
densities of the Ba^'^ currents (15 mN BaCl,) recorded from whole 
cells of these transf ectants differed dramatically. The 
average currents from 7 to 11 cells of three types of 
transfections (no ofji,; 2:1.8:1 [of^B-i :«2b = /5i-2] molar ratio; and 
2:10.9:1 [^ib-i sofabZ /S^.j] molar ratio) were determined. The 
smallest currents (range: 10 to 205 pA) were recorded when 
was not included in the transfection, and the largest currents 
(range: 50 to 8300 pA) were recorded with the 2:10.9:1 ratio of 
o^iB-iQf2bi^i-2 plasmids, the ratio that resulted in near equivalent 
mRNA levels for each sxibunit transcript. When the amount of 
plasmid was adjusted to yield approximately an equal abundance 
of subunit mRNAs, the average peak Ba^* current increased from 
433 pA to 1,824 pA (4.2-fold) with a corresponding increase in 
average current density from 26 pA/pP to 127 pA/pF (4.9-fold). 
This increase is in the presence of a 2.7-fold decrease in the 
mass of axB.i and ^^.j expression plasmids in the transfections. 
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In all transfections, the magnitudes of the Ba^* currents did 
not follow a normal distribution. 

To compare the subunit combinations and determine the 
effects of a^, the current -voltage properties of cells 
transfected with a„.ifii-2 or with a^^.^oc^^Z^.:, in either the 2:1.8:1 
(aiB.i:aab:^i.2) molar ratio or the 2:10.9:1 ict^B^i za^i^ip^.^) molar 
ratio transfectants were examined. The extreme examples of no 
Q!2b and 11.25 /zg a^^, (2:10.9:1 molar ratio) showed no significant 
differences in the current voltage plot at test potentials 
between 0 mV and +40 mV {p<0.05). The slight differences 
observed at either side of the peak region of the current 
voltage plot were likely due to normalization. The very small 
currents observed in the a„.ii5i.j transfected cells have a 
substantially higher component of residual leak relative to the 
barium current that is activated by the test pulse. When the 
current voltage plots are normalized, this leak is a much 
greater conponent than in the ot^^.jOt2hl^i-2 transfected cells and 
as a result, the current -voltage plot appears broader. This is 
the most likely explanation of the apparent differences in the 
current voltage plots, especially given the fact that the 
current -voltage plot for the ot^.:,P^,2 transfected cells diverge 
on both sides of the peak. Typically, when the voltage- 
dependence activation is shifted, the entire ciirr en t -voltage 
plot is shifted, which was not observed. To qualitatively 
compare the kinetics of each, the average responses of test 
pulses from -90 mV to 10 mV were normalized and plotted. No 
significant differences in activation or inactivation kinetics 
of whole-cell Ba'* currents were observed with any combination. 

6. Expression of DNA encoding human neuronal calcium 
cl^aiittel "ii-attai/^i-a «d ofi,.ia„/5i.3 subunits in HEK cells 

Functional expression of the a^^.^ot^nPu^ and a^E-iOt2BPi.3 , as 
well as was evaluated using the whole cell recording 

technique . 

1. Methods 
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Recordings were performed on transiently transfected HEK 
293 cells two days following the transfection, from cells that 
had no contacts with surrounding cells and which had simple 
morphology. 

The internal solution used to fill pipettes for recording 
the barium ciirrent from the transfected recombincoit calcium 
channels was (in mM) 135 CsCl, 10 E6TA, 1 MgCl,, 10 HEPES, and 

4 mM Mg-ATP (pH 7.4-7.5, adjusted with TEA-OH) . The external 
solution for recording the barium current was (in mM) 15 BaCl,, 
150 Choline Cl, 1 MgCl^, and 10 HEPES and 5 TEA-OH (pH 7.3, 
adjusted with TEA-OH) . In experiments in which Ca^* was 
replaced for Ba^", a Laminar flow chamber was used in order to 
completely exchange the extracellular solution and prevent any 
mixing of Ba^* and Ca**. w-CgTx GVIA was prepared in the 
external solution with 0.1% cytochrome C to serve as a carrier, 
the toxin was applied by pressurized puffer pipette. Series 
resistance was compensated 70-85% and currents were analyzed 
only if the voltage error from series resistcince was less than 

5 mV- Leak resistance and capacitance was corrected by 
siibtractlng the scaled current observed with the P/-4 protocol 
as iirplemented by pClamp (Axon Instruments) . 

2. Electrophyslology Results 

Cells trauisf ected with 0fiB-ia2bi8i-3 or Ofig.aOfajJS^.j showed strong 
barium currents with whole cell patch clamp recordings. Cells 
expressing Qf^E-sOfiBpi-a had larger peak currents than those 
expressing ot^s-i^jifii'^ - In addition, the kinetics of activation 
and inactivation are clearly substsmtially faster in the cells 
expressing a^^ calcium channels. HEK 293 cells expressing a^^.y 
alone have a significant degree of functional calcium channels, 
with properties similar to those eaqpressing ot^t!^2hPi-y but with 
substantially smaller peak barium currents. Thus, with a^^, the 
02 and /Si subunits are not required for functional expression of 
ofiE mediated calcium channels, but do substantially increase the 
number of functional calcium chcumels. 

Examination of the current voltage properties of aiBOfabi^io 
expressing cells indicates that ofiB.jajbiSi-a is a high-voltage 
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activated calcium channel and the peak current is reached at a 
potential only slightly less positive than other neuronal 
calcium channels also es^ressing a^i, and p^, and a^^^ and a^. 
Current voltage properties of QfiB.i0f2j,^i.3 and aiE^aabiSi-j are 
statistically different from those of aiB-i«2bi5i.3 . Current 
voltage curves for a^.^aab^^.j and a^.^ci^bPi-s peak at approximately 
+5mV, as does the current voltage curve for a^^,^ alone. 

The kinetics and voltage dependence of inactivation using 
both prepulse (200 ms) and steady-state inactivation was 
examined, a^^ mediated calcium channels are rapidly inactivated 
relative to previously cloned calcium channels and other high 
voltage-activated calcium channels, QfiB-sttab/^i-s mediated calcium 
channels are inactivated rapidly and are thus sensitive to 
relatively brief (200 ms) prepulses as well as long prepulses 
(>20s steady state inactivation) , but recover rapidly from 
steady state inactivation. The kinetics of the rapid 
inactivation has two components, one with a time constant of 
approximately 25 ms and the other approximately 400 ms. 

To determine whether a^^ mediated calcium channels have 
properties of low voltage activated calcium channels, the 
details of tail currents activated by a test pulse ranging -60 
to +90 mV were measured at -60 mV. Tail cxurrents recorded at - 
60 mV could be well fit by a single exponential of 150 to 300 
fis; at least an order of magnitude faster than those typically 
observed with low voltage- activated calcium channels, 

HEK 293 cells expressing (x^z-2^2hfii-:i flm more current with 
Ba^* as the charge carrier and currents carried by Ba'* and Ca'* 
have different current -voltage properties. Furthermore, the 
time course of inactivation is slower and the amount of 
prepulse inactivation less with Ca^* as the charge carrier. 

While the invention has been described with some 
specificity, modifications apparent to those with ordinary 
skill in the art may be made without departing from the scope 
of the invention. Since such modifications will be apparent to 
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those of skill in the art, it is intended that this invention 
be limited only by the scope of the appended claims. 
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(2) INFORMATION FOR SEQ ID NO:l: 
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(B) TYPE: nucleic acid 
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(A) KANE/KEY: 3'UTR 

(B) LOCATION: 6994.. 7635 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GGGC6A6CGC CTCCGTCCCC GGATGTGA6C TCCGGCTGCC CGCGGTCCC6 A0CCA6CGGC 60 

6CGCGG6CG6 CG6CGX3CG66 CACC6G6CAC C6C6GCGGGC GGGCAGACGG GCGG6CAT6G 120 

G6GGAGCGCC GAGCG6CCCC GGCG6CCGG6 CCG6CATCAC C6CGGCGTCT CTCCGCTAGA 180 

GGAGG66ACA A6CCAGTTCT CCTTTGCA6C AAAAAATTAC ATGTATATAT TATTAAGATA 240 

ATATATACAT TGGATTTTAT TTTTTTAAAA AGTTTATTTT GCTCCATTTT TGAAAAAGAG 300 

AGA6CTTGGG T6GCGAGCG6 TTTTTTTTTA AAATCAATTA TCCTTATTTT CTGTTATTT6 360 

TCCCC6TCCC TCCCCACCCC CCT6CTGAA6 CGAGAATAA6 GGCAGGGACC 6CG6CTCCTA 420 

CCTCTTGGTG ATCCCCTTCC CCATTCCGCC CCCGCCCCAA CGCCCASCAC AOTGCCCTGC 480 

ACACA6TAGT CGCTCAATAA ATGTTCGTG6 AT6 ATG ATG AT6 ATG AT6 ATG AAA 534 

Net Met Met Met Met Met Met Lys 
1 5 

AAA ATG CAG CAT CAA CGG CAG CAG CAA GCG GAG CAC GCG AAC GAG GCA 582 
Lys Met Gin His Gin Arg Gin Gin Gin Ala Asp His Ala Asn Glu Ala 
10 15 20 

AAC TAT GCA AGA GGC ACC AGA CTT CCT CTT TCT GGT GAA GGA CCA ACT 630 
Asn Tyr Ala Arg Gly Tbr Arg Leu Pro Leu Ser Gly Glu Gly Pro Thr 
25 30 35 40 

TCT CAG CCG AAT AGC TCC AAG CAA ACT GTC CTG TCT TGG CAA GCT GCA 678 
Ser Gin Pro Asn Ser Ser Lys Gin Thr Val Leu Ser Trp Gin Ala Ala 
45 50 55 

ATC GAT GCT GCT AGA CAG GCC AAG GCT GCC CAA ACT ATG AGC ACC TCT 726 
lie Asp Ala Ala Arg Gin Ala Lys Ala Ala Gin Thr Net Ser Thr Ser 
60 65 70 

GCA CCC CCA CCT GTA GGA TCT CTC TCC CAA AGA AAA CGT CAG CAA TAC 774 
Ala Pro Pro Pro Val Gly Ser Leu Ser Gin Arg Lys Arg Oln Gin Tyr 
75 80 85 

GCC AAG AGC AAA AAA CAG GGT AAC TCG TCC AAC AGC CGA CCT GCC CGC 822 
Ala Lys Ser Lys Lys Gin Gly Asn Ser Ser Asn Ser Arg Pro Ala Arg 
90 95 100 

GCC CTT TTC TGT TTA TCA CTC AAT AAC CCC ATC CGA AGA GCC TGC ATT 870 
Ala Leu Phe Cys Leu Ser Leu Asn Asn Pro lie Arg Arg Ala Cys lie 
105 110 115 120 

AGT ATA GTG GAA TGG AAA CCA TTT GAC ATA TTT ATA TTA TTG GCT ATT 918 
Ser lie Val Glu Trp Lys Pro Phe Asp lie Phe lie Leu Leu Ala lie 
125 130 135 
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TTT GCC AAT TGT GTG GCC TTA OCT ATT TAG ATC CCA TTC CCT GAA GAT 966 
Phe Ala Asn Cys Val Ala Leu Ala lie Tyr lie Pro Phe Pro Glu Asp 
140 145 150 

GAT TCT AAT TCA ACA AAT CAT AAC TTG GAA AAA GTA GAA TAT GCC TTC 1014 
Asp Ser Asn Ser Thr Asn His Asn Leu Glu Lys Val Glu Tyr Ala Phe 
155 160 165 

CTG ATT ATT TTT ACA GTC GAG ACA TTT TTG AAG ATT ATA GCG TAT GGA 1062 
Leu lie He Phe Thr Val Glu Thr Phe Leu Lys He He Ala Tyr Gly 
170 175 180 

TTA TTG CTA CAT CCT AAT GCT TAT GTT AGO AAT GGA TGG AAT TTA CTG 1110 
Leu Leu Leu His Pro Asn Ala Tyr Val Arg Asn Gly Trp Asn Leu I«eu 
185 190 195 200 

GAT TTT GTT ATA GTA ATA GTA GGA TTG TTT AGT GTA ATT TTG GAA CAA 1158 
Asp Phe Val He Val He Val Gly Leu Phe Ser Val He Leu Glu Gin 
205 210 215 

TTA ACC AAA GAA ACA GAA GGC GGG AAC CAC TCA AGC GGC AAA TCT GGA 1206 
Leu Thr Lys Glu Thr Glu Gly Gly Asn His Ser Ser Gly Lys Ser Gly 
220 225 230 

GGC TTT GAT GTC AAA GCC CTC CGT GCC TTT CGA GTG TTG CGA CCA CTT 1254 
Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu Arg Pro Leu 
235 240 245 

CGA CTA GTG TCA GGA GTG CCC AGT TTA CAA GTT GTC CTG AAC TCC ATT 1302 
Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn Ser He 
250 255 260 

ATA AAA GCC AT6 GTT CCC CTC CTT CAC ATA GCC CTT TTG GTA TTA TTT 1350 
He Lys Ala Met Val Pro Leu Leu His He Ala Leu Leu Val Leu Phe 
265 270 275 280 

GTA ATC ATA ATC TAT GCT ATT ATA GGA TTG GAA CTT TTT ATT GGA AAA 1398 
Val He He He Tyr Ala He He Gly Leu Glu Leu Phe He Gly Lys 
285 290 295 

ATG CAC AAA ACA TGT TTT TTT GCT GAC TCA GAT ATC GTA GCT GAA GAG 1446 
Met His Lys Thr Cys Phe Phe Ala Asp Ser Asp He Val Ala Glu Glu 
300 305 310 

GAC CCA GCT CCA TGT GCG TTC TCA GGG AAT GGA CGC CAG TGT ACT GCC 1494 
Asp Pro Ala Pro Cys Ala Phe Ser Gly Asn Gly Arg Gin Cys Thr Ala 
315 320 325 

AAT GGC ACG GAA TGT AGG AGT GGC TGG GTT GGC CCG AAC GGA GGC ATC 1542 
Asn Gly Thr Glu Cys Arg Ser Gly Trp Val Gly Pro Asn Gly Gly He 
330 335 340 

ACC AAC TTT GAT AAC TTT GCC TTT GCC ATG CTT ACT GTG TTT CAG TGC 1590 
Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr Val Phe Gin Cys 
345 350 355 360 
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ATC ACC ATG GAG GGC TGG ACA GAC GTG CTC TAC TGG ATG AAT GAT GCT 1638 
lie Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp Met Asn Asp Ala 
365 370 375 

ATG GGA TTT GAA TTG CCC TGG GTG TAT TTT GTC AGT CTC GTC ATC TTT 1686 
Met Gly Phe Glu Leu Pro Trp Val Tyr Phe Val Ser Leu Val lie Phe 
380 385 390 

GGG TCA TTT TTC GTA CTA AAT CTT GTA CTT GGT GTA TTG AGC GGA GAA 1734 
Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu 
395 400 405 

TTC TCA AAG GAA AGA GAG AAG GCA AAA 6CA CGG GGA GAT TTC CAG AAG 1782 
Phe Ser Lys Glu Arg Glu Lys Ala Lys Ala Arg Gly Asp Phe Gin Lys 
410 415 420 

CTC CGG GAG AAG CAG CAG CTG GAG GAG GAT CTA AAG GGC TAC TTG GAT 1830 
Leu Arg Glu Lys Gin Gin Leu Glu Glu Asp Leu Lys Gly Tyr Leu Asp 
425 430 435 440 

TGG ATC ACC CAA GCT GAG GAC ATC GAT CCG GAG AAT GAG GAA GAA GGA 1878 
Trp lie Thr Gin Ala Glu Asp lie Asp Pro Glu Asn Glu Glu Glu Gly 
445 450 455 

GGA GAG GAA GGC AAA CGA AAT ACT AGC ATG CCC ACC AGC GAG ACT GAG 1926 
Gly Glu Glu Gly Lys Arg Asn Thr Ser Met Pro Thr Ser Glu Thr Glu 
460 465 470 

TCT GTG AAC ACA GAG AAC GTC AGC GGT GAA GGC GAG AAC OGA GGC TGC 1974 
Ser Val Asn Thr Glu Asn Val Ser Gly Glu Gly Glu Asn Arg Gly Cys 
475 480 485 

TGT GGA AGT CTC TGT CAA GCC ATC TCA AAA TCC AAA CTC AGC CGA CGC 2022 
Cys Gly Ser I^eu Cys Gin Ala lie Ser Lys Ser Lys Leu Ser Arg Arg 
490 495 500 

TGG CGT CGC TGG AAC CGA TTC AAT CGC AGA AGA TGT AGG GCC GCC GTG 2070 
Trp Arg Arg Trp Asn Arg Phe Asn Arg Arg Arg Cys Arg Ala Ala Val 
505 510 515 520 

AAG TCT GTC ACG TTT TAC TGG CTG GTT ATC GTC CTG GTG TTT CTG AAC 2118 
Lys Ser Val Thr Phe Tyr Trp Leu Val He Val Leu Val Phe Leu Asn 
525 530 535 

ACC TTA ACC ATT TCC TCT GAG CAC TAC AAT CAG CCA GAT TGG TTG ACA 2166 
Thr Leu Thr He Ser Ser Glu His Tyr Asn Gin Pro Asp Trp Leu Thr 
540 545 550 

CAG ATT CAA GAT ATT GCC AAC AAA GTC CTC TTG GCT CTG TTC ACC TGC 2214 
Gin He Gin Asp He Ala Asn Lys Val Leu Leu Ala Leu Phe Thr Cys 
555 560 565 

GAG ATG CTG GTA AAA ATG TAC AGC TTG GGC CTC CAA GCA TAT TTC GTC 2262 
Glu Met Leu Val Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr Phe Val 
570 575 580 
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TCT CTT TTC AAC CGG TTT GAT TGC TTC GTG GTG TGT GOT GGA ATC ACT 2310 
Ser Leu Phe Asn Arg Phe Asp Cys Phe Val val Cys Gly 61y lie Thr 
585 590 595 600 

GAG AC6 ATC TTG GTG GAA CTG GAA ATC ATG TCT CCC CTG GGG ATC TCT 2358 
Glu Thr He Leu Val Glu Leu Glu He Met Ser Pro Leu Gly He Ser 
605 610 615 

GTG TTT CGG TGT GTG CGC CTC TTA AGA ATC TTC AAA GTG ACC AGG CAC 2406 
Val Phe Arg Cys Val Arg Leu Leu Arg He Phe Lys Val Thr Arg His 
620 625 630 

TGG ACT TCC CTG AGC AAC TTA GTG GCA TCC TTA TTA AAC TCC ATG AA6 2454 
Trp Thr Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser Met Lys 
635 640 645 

TCC ATC GCT TCG CTG TTG CTT CTG CTT TTT CTC TTC ATT ATC ATC TTT 2502 
Ser He Ala Ser Leu I«eu Leu Leu Leu Phe Leu Phe He He He Phe 
650 655 660 

TCC TTG CTT GGG ATG CAG CTG TTT GGC GGC AAG TTT AAT TTT GAT GAA 2550 
Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe Asp Glu 
665 670 675 680 

ACG CAA ACC AAG CGG AGC ACC TTT GAC AAT TTC CCT CAA GCA CTT CTC 2598 
Thr Gin Thr Lys Arg Ser Thr Phe Asp Asn Phe Pro Gin Ala Leu Leu 
685 690 695 

ACA GTG TTC CAG ATC CTG ACA GGC GAA GAC TGG AAT GCT GTG ATG TAC 2646 
Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Ala Val Met Tyr 
700 705 710 

GAT GGC ATC ATG GCT TAC GGG GGC CCA TCC TCT TCA GGA ATG ATC GTC 2694 
Asp Gly He Met Ala Tyr Gly Gly Pro Ser Ser Ser Gly Net He Val 
715 720 725 

TGC ATC TAC TTC ATC ATC CTC TTC ATT TGT GGT AAC TAT ATT CTA CTG 2742 
Cys He Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He Leu Leu 
730 735 740 

AAT GTC TTC TTG GCC ATC GCT GTA GAC AAT TTG GCT GAT GCT GAA AGT 2790 
Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala Glu Ser 
745 750 755 760 

CTG AAC ACT GCT CAG AAA GAA GAA GCG GAA GAA AAG GAG AGG AAA AAG 2838 
Leu Asn Thr Ala Gin Lys Glu Glu Ala Glu Glu Lys Glu Arg Lys Lys 
765 770 775 

ATT GCC AGA AAA GAG AGC CTA GAA AAT AAA AAG AAC AAC AAA CCA GAA 2886 
He Ala Arg Lys Glu Ser Leu Glu Asn Lys Lys Asn Asn Lys Pro Glu 
780 785 790 

GTC AAC CAG ATA GCC AAC AGT GAC AAC AAG GTT ACA ATT GAT GAC TAT 2934 
Val Asn Gin He Ala Asn Ser Asp Asn Lys Val Thr He Asp Asp Tyr 
795 800 805 
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AGA GAA GAG GAT GAA GAC AAG GAC CCC TAT CCG CCT TGC GAT GTG CCA 2982 
Arg Glu Glu Asp Glu Asp Lys Hsp Pro Tyr Pro Pro Cys Asp Val Pro 
810 815 820 

GTA GGG GAA GAG GAA GAG GAA GAG GAG GAG GAT GAA CCT GAG 6TT CCT 3030 
Val Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp Glu Pro Glu Val Pro 
825 830 835 840 

GCC GGA CCC CGT CCT CGA AGG ATC TCG GAG TTG AAC ATG AAG GAA AAA 3078 
Ala Gly Pro Arg Pro Arg Arg lie Ser Glu Leu Asn Net Lys Glu Lys 
845 850 855 

ATT GCC CCC ATC CCT GAA GGG AGC GCT TTC TTC ATT CTT AGC AAG ACC 3126 
lie Ala Pro He Pro Glu Gly Ser Ala Phe Phe He Leu Ser Lys Thr 
860 865 870 

AAC CCG ATC CGC GTA GGC TGC CAC AAG CTC ATC AAC CAC CAC ATC TTC 3174 
Asn Pro He Arg Val Gly Cys Bis Lys Leu He Asn His His He Phe 
875 880 885 

ACC AAC CTC ATC CTT GTC TTC ATC ATG CTG AGC AGT GCT GCC CTG GCC 3222 
Thr Asn Leu He Leu Val Phe He Net Leu Ser Ser Ala Ala Leu Ala 
890 895 900 

GCA GAG GAC CCC ATC CGC AGC CAC TCC TTC CG6 AAC ACG ATA CTG GGT 3270 
Ala Glu Asp Pro He Arg Ser His Ser Phe Arg Asn Thr He Leu Gly 
905 910 915 920 

TAC TTT GAC TAT GCC TTC ACA GCC ATC TTT ACT GTT GAG ATC CTG TTG 3318 
Tyr Phe Asp Tyr Ala Phe Thr Ala He Phe Thr Val Glu He Leu Leu 
925 930 935 

AAG ATG ACA ACT TTT GGA GCT TTC CTC CAC AAA GGG GCC TTC TGC AGG 3366 
Lys Net Thr Thr Phe Gly Ala Phe Leu His Lys Gly Ala Phe Cys Arg 
940 945 950 

AAC TAC TTC AAT TTG CTG GAT ATG CTG GTG GTT GGG GTG TCT CTG GTG 3414 
Asn Tyr Phe Asn Leu Leu Asp Net Leu Val Val Gly Val Ser Leu Val 
955 960 965 

TCA TTT GGG ATT CAA TCC AGT GCC ATC TCC GTT GTG AAG ATT CTG AGG 3462 
Ser Phe Gly He Gin Ser Ser Ala He Ser Val Val Lys He Leu Arg 
970 975 980 

GTC TTA AGG GTC CTG CGT CCC CTC AGG GCC ATC AAC AGA GCA AAA GGA 3510 
Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg Ala Lys Gly 
985 990 995 1000 

CTT AAG CAC GTG GTC CAG TGC GTC TTC GTG GCC ATC CGG ACC ATC GGC 3558 
Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg Thr He Gly 
1005 1010 1015 

AAC ATC ATG ATC GTC ACC ACC CTC CTG CAG TTC ATG TTT GCC TGT ATC 3606 
Asn He Net He Val Thr Thr Leu Leu Gin Phe Net Phe Ala Cys He 
1020 1025 1030 
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GGG GTC CAG TTG TTC AAG GGG AAG TTC TAT CGC TGT ACQ GAT GAA GCC 3654 
Gly Val Gin Leu Phe Lys Gly Lys Phe Tyr Arg Cys Thr Asp Glu Ala 
1035 1040 1045 

AAA A6T AAC CCT GAA GAA TGC AGG GGA CTT TTC ATC CTC TAC AAG GAT 3702 
Lys Ser Asn Pro Glu Glu Cys Arg Gly Leu Phe lie Leu Tyr Lys Asp 
1050 1055 1060 

GGG GAT GTT GAC AGT CCT GTG GTC CGT GAA CGG ATC TGG CAA AAC A6T 3750 
Gly Asp Val Asp Ser Pro Val Val Arg Glu Arg lie Trp Gin Asn Ser 
1065 1070 1075 1080 

GAT TTC AAC TTC GAC AAC GTC CTC TCT GCT ATG ATQ GCG CTC TTC ACA 3798 
Asp Phe Asn Phe Asp Asn Val Leu Ser Ala Met Met Ala Leu Phe Thr 
1085 1090 1095 

GTC TCC ACG TTT GAG GGC TGG CCT GCG TTG CTG TAT A/J^ GCC ATC GAC 3846 
Val Ser Thr Phe Glu Gly Trp Pro Ala Leu Leu Tyr Lys Ala He Asp 
1100 1105 1110 

TCG AAT GGA GAG AAC ATC GGC CCA ATC TAC AAC CAC CGC GTG GAG ATC 3894 
Ser Asn Gly Glu Asn He Gly Pro He Tyr Asn His Arg Val Glu He 
1115 1120 1125 

TCC ATC TTC TTC ATC ATC TAC ATC ATC ATT GTA GCT TTC TTC ATG ATG 3942 
Ser He Phe Phe He He Tyr He He He Val Ala Phe Phe Met Met 
1130 1135 1140 

AAC ATC TTT GTG GGC TTT GTC ATC GTT ACA TTT CAG G7VA CAA GGA GAA 3990 
Asn He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu 
1145 1150 1155 1160 

AAA GAG TAT AAG AAC TGT GAG CTG GAC AAA AAT CAG CGT CAG TGT GTT 4038 
Lys Glu Tyr Lys Asn Cys Glu Leu Asp Lys Asn Gin Arg Gin Cys Val 
1165 1170 1175 

GAA TAC GCC TTG AAA GCA CGT CCC TTG CGG AGA TAC ATC CCC AAA AAC 4086 
Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn 
1180 1185 1190 

CCC TAC CAG TAC AAG TTC TGG TAC GTG GTG AAC TCT TCG CCT TTC GAA 4134 
Pro Tyr Gin Tyr Lys Phe Trp Tyr Val Val Asn Ser Ser Pro Phe Glu 
1195 1200 1205 

TAC ATG ATG TTT GTC CTC ATC ATG CTC AAC ACA CTC TGC TTG GCC ATG 4182 
Tyr Met Met Phe Val Leu He Met Leu Asn Thr Leu Cys Leu Ala Met 
1210 1215 1220 

CAG CAC TAC GAG CAG TCC AAG ATG TTC AAT GAT GCC ATG GAC ATT CTG 4230 
Gin His Tyr Glu Gin Ser Lys Met Phe Asn Asp Ala Met Asp He^ Leu 
1225 1230 1235 1240 

AAC ATG GTC TTC ACC GGG GTG TTC ACC GTC GAG ATG GTT TTG AAA GTC 4278 
Asn Met Val Phe Thr Gly Val Phe Thr Val Glu Met Val Leu Lys Val 
1245 1250 1255 
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ATC GCA TTT AAG CCT AAG GGG TAT TTT ACT GAC GCC TGG AAC ACQ TTT 4326 
lie Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Ala Trp Asn Thr Phe 
1260 1265 1270 

GAC TCC CTC ATC GTA ATC GGC AGC ATT ATA GAC GTG GCC CTC AGC GAA 4374 
Asp Ser Leu He Val He Gly Ser He He Asp Val Ala Leu Ser Glu 
1275 1280 1285 

GCA GAC CCA ACT GAA AGT GAA AAT GTC CCT GTC CCA ACT GCT ACA CCT 4422 
Ala Asp Pro Thr Glu Ser Glu Asn Val Pro Val Pro Thr Ala Thr Pro 
1290 1295 1300 

GGG AAC TCT GAA GAG AGC AAT AGA ATC TCC ATC ACC TTT TTC CGT CTT 4470 
Gly Asn Ser Glu Glu Ser Asn Arg He Ser He Thr Phe Phe Arg Leu 
1305 1310 1315 1320 

TTC CGA GTG ATG CGA TTG GTG AAG CTT CTC AGC AGG GGG GAA GGC ATC 4518 
Phe Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He 
1325 1330 1335 

CGG ACA TTG CTG TGG ACT TTT ATT AAG TTC TTT CAG GCG CTC COG TAT 4566 
Arg Thr Leu Leu Trp Thr Phe He Lys Phe Phe Gin Ala Leu Pro Tyr 
1340 1345 1350 

GTG GCC CTC CTC ATA GCC ATG CTG TTC TTC ATC TAT GCG GTC ATT GGC 4614 
Val Ala Leu Leu He Ala Met Leu Phe Phe He Tyr Ala Val He Gly 
1355 1360 1365 

ATG CAG ATG TTT GGG AAA GTT GCC ATG AGA GAT AAC AAC CAG ATC AAT 4662 
Met Gin Met Phe Gly Lys Val Ala Met Arg Asp Asn Asn Gin He Asn 
1370 1375 13B0 

AGG AAC AAT AAC TTC CAG ACQ TTT GCC CAG GCG GTG CTG CTG CTC TTC 4710 
Arg Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe 
1385 1390 1395 1400 

AGG TGT GCA ACA GGT GAG GCC TGG CAS GAG ATC ATG CTG GCC TGT CTC 4758 
Arg Cys Ala Thr Gly Glu Ala Trp Gin Glu He Met Leu Ala Cys Leu 
1405 1410 1415 

CCA GGG AAG CTC TGT GAC CCT GAG TCA GAT TAC AAC CCC GGG GAG GAG 4806 
Pro Gly Lys Leu Cys Asp Pro Glu Ser Asp Tyr Asn Pro Gly Glu Glu 
1420 1425 1430 

CAT ACA TGT GGG AGC AAC TTT GCC ATT GTC TAT TTC ATC AGT TTT TAC 4854 
His Thr Cys Gly Ser Asn Phe Ala He Val Tyr Phe He Ser Phe Tyr 
1435 1440 1445 

ATG CTC TGT GCA TTT CTG ATC ATC AAT CTG TTT GTG GCT GTC ATC ATG 4902 
Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He Met 
1450 1455 1460 

GAT AAT TTC GAC TAT CTG ACC CGG GAC TGG TCT ATT TTG GGG CCT CAC 4950 
Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro His 
1465 1470 1475 1480 



-123- 



CAT TTA GAT GAA TTC AAA AGA ATA TGG TCA GAA TAT GAC CCT GAG GCA 4998 
His Leu Asp Glu Phe Lys Arg He Trp Ser Glu Tyr Asp Pro Glu Ala 
1485 1490 1495 

AAG 6GA AGG ATA AAA CAC CTT GAT GTG GTC ACT CTG CTT CGA CGC ATC 5046 
Lys Gly Arg He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg He 
1500 1505 1510 

CAG CCT CCC CTG GGG TTT GGG AAG TTA TGT CCA CAC AGG GTA GCG TGC 5094 
Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys 
1515 1520 1525 

AAG AGA TTA GTT GCC ATG AAC ATG CCT CTC AAC AGT GAC GGG ACA GTC 5142 
Lys Arg Leu Val Ala Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val 
1530 1535 1540 

ATG TTT AAT GCA ACC CTG TTT GCT TTG GTT CGA ACG GCT CTT AAG ATC 5190 
Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Lys He 
1545 1550 1555 1560 

AAG ACC GAA GGG AAC CTG GAG CAA GCT AAT GAA GAA CTT CGG GCT GTG 5238 
Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala Val 
1565 1570 1575 

ATA AAG AAA ATT TGG AAG AAA ACC AGC ATG AAA TTA CTT GAC CAA GTT 5286 
He Lys Lys He Trp Lys Lys Thr Ser Met Lys Leu Leu Asp Gin Val 
1580 1585 1590 

GTC CCT CCA GCT GGT GAT GAT GAG GTA ACC GTG GGG AAG TTC TAT GCC 5334 
Val Pro Pro Ala Oly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala 
1595 1600 1605 

ACT TTC CTG ATA CAG GAC TAC TTT AGG AAA TTC AAG AAA CGG AAA GAA 5382 
Thr Phe Leu He Gin Asp Tyr Phe Arg Lys Phe Lys Lys Arg Lys Glu 
1610 1615 1620 

CAA GGA CTG GTG GGA AAG TAC CCT GCG AAG AAC ACC ACA ATT GCC CTA 5430 
Gin Gly Leu Val Gly Lys Tyr Pro Ala Lys Asn Thr Thr He Ala Leu 
1625 1630 1635 1640 

CAG GCG GGA TTA AGG ACA CTG CAT GAC ATT GGG CCA GAA ATC CGG CGT 5478 
Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1645 1650 1655 

GCT ATA TCG TGT GAT TTG CAA GAT GAC GAG CCT GAG GAA ACA AAA CGA 5526 
Ala He Ser Cys Asp Leu Gin Asp Asp Glu Pro Glu Glu Thr Lys Arg 
1660 1665 1670 

GAA GAA GAA GAT GAT GTG TTC AAA AGA AAT GGT GCC CTG CTT GGA AAC 5574 
Glu Glu Glu Asp Asp Val Phe Lys Arg Asn Gly Ala Leu Leu Gly Asn 
1675 1680 1685 

CAT GTC AAT CAT GTT AAT AGT GAT AGG AGA GAT TCC CTT CAG CAG ACC 5622 
His Val Asn His Val Asn Ser Asp Arg Arg Asp Ser Leu Gin Gin Thr 
1690 1695 1700 
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AAT ACC ACC CAC CGT CCC CTG CAT GTC CAA AGG CCT TCA ATT CCA CCT 5670 
Asn Thr Thr His Arg Pro Leu His val Gin Arg Pro Ser He Pro Pro 
1705 1710 1715 1720 

GCA AGT GAT ACT GAG AAA CCG CTG TTT CCT CCA GCA 6GA AAT TCG GTG 5718 
Ala Ser Asp Thr Glu Lys Pro Leu Phe Pro Pro Ala Gly Asn Ser Val 
1725 1730 1735 

TGT CAT AAC CAT CAT AAC CAT AAT TCC ATA GGA AAG CAA GTT CCC ACC 5766 
Cys His Asn His His Asn His Asn Ser He Gly Lys Gin Val Pro Thr 
1740 1745 1750 

TCA ACA AAT GCC AAT CTC AAT AAT GCC AAT ATG TCC AAA GCT GCC CAT 5814 
Ser Tfhr Asn Ala Asn Leu Asn Asn Ala Asn Met Ser Lys Ala Ala His 
1755 1760 1765 

GGA AAG CG6 CCC AGC ATT GGG AAC CTT GAG CAT GTG TCT GAA AAT GGG 5862 
Gly Lys Arg Pro Ser He Gly Asn Leu Glu His Val Ser Glu Asn Gly 
1770 1775 1780 

CAT CAT TCT TCC CAC AAG CAT GAC CGG GAG CCT CAG AGA AGG TCC AGT 5910 
His His Ser Ser His Lys His Asp Arg Glu Pro Gin Arg Arg Ser Ser 
1785 1790 1795 1800 

GTG AAA AGA ACC CGC TAT TAT GAA ACT TAC ATT AGG TCC GAC TCA GGA 5958 
Val Lys Arg Thr Arg Tyr Tyr Glu Thr Tyr He Arg Ser Asp Ser Gly 
1805 1810 1815 

GAT GAA CAG CTC CCA ACT ATT TGC CGG GAA GAC CCA GAG ATA CAT GGC 6006 
Asp Glu Gin Leu Pro Thr He Cys Arg Glu Asp Pro Glu He His Gly 
1820 1825 1830 

TAT TTC AGG GAC CCC CAC TGC TTG GGG GAG CAG GAG TAT TTC AGT AGT 6054 
Tyr Phe Arg Asp Pro His Cys Leu Gly Glu Gin Glu Tyr Phe Ser Ser 
1835 1840 1845 

GAG GAA TGC TAC GAG GAT GAC 7^ TCG CCC ACC TGG AGC AOG CAA AAC 6102 
Glu Glu Cys Tyr Glu Asp Asp Ser Ser Pro Thr Trp Ser Arg Gin Asn 
1850 1855 1860 

TAT GGC TAC TAC AGC AGA TAC CCA GGC AGA AAC ATC GAC TCT GAG AGG 6150 
Tyr Gly Tyr Tyr Ser Arg Tyr Pro Gly Arg Asn He Asp Ser Glu Arg 
1865 1870 1875 1880 

CCC CGA GGC TAC CAT CAT CCC CAA GGA TTC TTG GAG GAC GAT GAC TCG 6198 
Pro Arg Gly Tyr His His Pro Gin Gly Phe lieu Glu Asp Asp Asp Ser 
1885 1690 1895 

CCC GTT TGC TAT GAT TCA CGG AGA TCT CCA AGG AGA CGC CTA CTA CCT 6246 
Pro Val Cys Tyr Asp Ser Arg Arg Ser Pro Arg Arg Arg Leu Leu Pro 
1900 1905 1910 

CCC ACC CCA GCA TCC CAC CGG AGA TCC TCC TTC AAC TTT GAG TGC CTG 6294 
Pro Thr Pro Ala Ser His Arg Arg Ser Ser Phe Asn Phe Glu Cys Leu 
1915 1920 1925 
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CX3C CGG CAG AGC AGC CAG GAA GAG GTC CCG TCG TCT CCC ATC TTC CCC 6342 
Arg Arg Gin Ser Ser Gin Glu Glu Val Pro- Ser Ser Pro lie Phe Pro 
1930 1935 1940 



CAT CGC 
His Arg 
1945 


ACG GCC 
Thr Ala 


CTG 
Leu 


CCT CTG 
Pro Leu 
1950 


CAT 
His 


CTA ATG 
Leu Met 


CAG CAA 
Gin Gin 
1955 


CAG 

Gin 


ATC 
He 


ATG 
Met 


GCA 
Ala 
1960 


6390 


GTT 
Val 


GCC 
Ala 


GGC CTA GAT TCA 
Gly Leu Asp Ser 
1965 


AGT 
Ser 


AAA GCC C/IG AAG 
Lys Ala Gin Lys 
1970 


TAC 
Tyr 


TCA 

Ser 


CCG 
Pro 


AGT CAC 
Ser His 
1975 


6438 


TCG 
Ser 


ACC 
Thr 


CGG TCG TGG 
Arg Ser Trp 
1980 


GCC 
Ala 


ACC 
Thr 


CCT 
Pro 


CCA GCA 
Pro Ala 
1965 


ACC 
Thr 


CCT 
Pro 


CCC 
Pro 


TAC CGG 
Tyr Arg 
1990 


GAC 
Asp 


6486 


TGG 
Trp 


ACA 
Thr 


CCG TGC 
Pro Cys 
1995 


TAC 
Tyr 


ACC 
Thr 


CCC 
Pro 


CTG ATC 
Leu He 
2000 


CAA 
Gin 


GTG 
Val 


GAG 
Glu 


CAG TCA 
Gin Ser 
2005 


GAG 
Glu 


GCC 
Ala 


6534 


CTG 
Leu 


GAC CAG 
Asp Gin 
2010 


GTG 
Val 


AAC 
Asn 


GGC 
Gly 


AGC CTG 
Ser Leu 
2015 


CCG 
Pro 


TCC 
ser 


CTG 
Leu 


CAC CGC 
His Arg 
2020 


AGC 
Ser 


TCC 
Ser 


TGG 
Trp 


6582 


TAC ACA 
Tyr Thr 
2025 


GAC 
Asp 


GAG 
Glu 


CCC 
Pro 


GAC ATC 
Asp lie 
2030 


TCC 
Ser 


TAC 
Tyr 


CGG 
Arg 


ACT TTC 
Thr Phe 
2035 


ACA 

Thr 


CCA 
Prx> 


GCC 
Ala 


AGC 
Ser 
2040 


6630 


CTG 
Leu 


ACT 
Thr 


GTC 
Val 


CCC 
Pro 


AGC AGC 
Ser Ser 
2045 


TTC 
Phe 


CGG 
Arg 


AAC 
Asn 


AAA AAC 
Lys Asn 
2050 


AGC 
Ser 


GAC 
Asp 


AAG 
Lys 


CAG AGG 
Gin Arg 
2055 


6678 


AGT 
Ser 


6CG 
Ala 


GAC 
Asp 


AGC TTG 
Ser Leu 
2060 


GTG 
Val 


GAG 
Glu 


GCA 
Ala 


GTC CTG 
Val Leu 
2065 


ATA 
He 


TCC 
Ser 


GAA 
Glu 


GGC TTG 
Gly Leu 
2070 


GGA 
Gly 


6726 


CGC 
Arg 


TAT 
Tyr 


GCA AGG 
Ala Arg 
2075 


GAC 
Asp 


CCA 
Pro 


AAA 

Lys 


TTT GTG 
Phe Val 
2080 


TCA 
Ser 


GCA 
Ala 


ACA 
Thr 


AAA CAC 
Lys His 
2085 


GAA 
Glu 


ATC 
He 


6774 


GCT 
Ala 


GAT GCC 
Asp Ala 
2090 


TGT 
Cys 


GAC 
Asp 


CTC 
Leu 


ACC ATC GAC 
Thr lie Asp 
2095 


GAG 
Glu 


ATG 
Net 


GAG AGT GCA 
Glu Ser Ala 
2100 


GCC 
Ala 


AGC 
Ser 


6822 


ACC CTG 
Thr Leu 
2105 


CTT 
Leu 


AAT 
Asn 


GGG 
Gly 


AAC GTG 
Asn Val 
2110 


CGT CCC 
Arg Pro 


CGA 
Arg 


GCC AAC GGG GAT 
Ala Asn Gly Asp 
2115 


GTG 
Val 


GGC 
Gly 
2120 


6870 


CCC 
Pro 


CTC 
Leu 


TCA 
Ser 


CAC 
His 


CGG CAG GAC 
Arg Gin Asp 
2125 


TAT GAG 
Tyr Glu 


CTA CAG GAC TTT GGT 
Leu Gin Asp Phe Gly 
2130 


CCT GGC 
Pro Gly 
2135 


6918 


TAC 


AGC 


GAC 


GAA 


GAG 


CCA 


GAC 


CCT 


GGG 


AGG 


GAT 


GAG 


GAG 


GAC 


CTG 


GCG 


6966 



Tyr Ser Asp Glu Glu Pro Asp Pro Gly Arg Asp Glu Glu Asp Leu Ala 
2140 2145 2150 
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GAT GAA ATG ATA TGC ATC ACC ACC TTG TAGCCCCCAG CGAGGG6CAG 7013 
Asp Glu Met He Cys He Thr Thr Leu 
2155 2160 

ACTGGCTCTG GCCTCAGGTG GGGCGCAGGA GAGCCAGGGG AAAAGTGCCT CATAGTTAGG 7073 

AAAGTTTAGG CACTAGTTGG GAGTAATATT CAATTAATTA GACTTTTGTA TAAGAGATGT 7133 

CATGCCTCAA GAAAGCCATA AACCTGGTAG GAACAGGTCC CAAGCGGTTG AGCCTGGCAG 7193 

AGTACCATGC 6CTCGGCCCC AGCTGCAGGA AACAGCAGGC CCCGCCCTCT CACAGA8GAT 7253 

GGGTGAGGAG GCCAGACCTG CCCTGCCCCA TT6TCCAGAT GGGCACTGCT 6TGGAGTCTG 7313 

CTTCTCCCAT GTACCAGGGC ACCA6GCCCA CCCAACTGAA GGCATGGCGG CXK3GGTGCAG 7373 

GG6AAAGTTA AAGGT6ATGA CGATCATCAC ACCT6TGTCG TTACCTCAGC CATCGGTCTA 7433 

QCMATCaXST CACTGGGCCC AACATATCCA TTTTTAAACC CTTTCXTCCCA AATACSICTGC 7493 

GTCCTGGTTC CTGTTTAGCT GTTCTGAAAT ACGGTGTGTA AGTAAGTCAG AACCCAGCTA 7553 

CCAGTGATTA TTGCGAGGGC AATGGGACCT CATAAATAAG GTTTTCTGTG ATGTGAC6CC 7613 

AGTTTACATA AGAGAATATC AC 7635 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .102 

(ix) FEATURE: 

(A) NAME/KEY : misc^feature 

(B) LOCATION: 1.-104 

(D) OTHER INFORMATION: /note* "A 104 -nucleotide 
alternative exon of alpha- ID." 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

GTA AAT GAT GCG ATA GGA TGG GAA TGG CCA TGG GTG TAT TTT GTT AGT 48 
Val Asn Asp Ala He Gly Trp Glu Trp Pro Trp Val Tyr Phe Val Ser 
15 10 15 

CTG ATC ATC CTT GGC TCA TTT TTC GTC CTT AAC CTG GTT CTT GGT GTC 96 
Leu He He Leu Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
20 25 30 
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CTT AGT GG 
Leu Ser 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 6575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCnLE TYPE: DNA (genomic) 

(ix) FBATORE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.«6492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG 6TC AAT GAG AAT ACG AGG ATG TAC ATT CCA GAG GAA AAC CAC CAA 
Met Val Asn Glu Asn Thr Arg Met Tyr lie Pro Glu Glu Asn His Gin 
1 5 10 15 

GGT TCC AAC TAT GGG AGC CCA CGC CCC GCC CAT GCC AAC ATG AAT GCC 
Gly Ser Asn Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Net Asn Ala 
20 25 30 

AAT GCG GCA GCG GGG CTG GCC CCT GAG CAC ATC CCC ACC CCG GGG GCT 
Asn Ala Ala Ala Gly Leu Ala Pro Glu His He Pro Thr Pro Gly Ala 
35 40 45 

GCC CTG TCG TGG CAG GCG GCC ATC GAC GCA GCC C66 CAG GCT AAG CTG 
Ala Leu Ser Trp Gin Ala Ala He Asp Ala Ala Arg Gin Ala Lys Leu 
50 55 60 

ATG G6C AGC GCT GGC AAT GCG ACC ATC TCC ACA GTC AGC TCC ACG CAG 
Met Gly Ser Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser Thr Gin 
65 70 75 80 

CGG AAG CGC CAG CAA TAT GGG AAA CCC AAG AAG CAG GGC AGC ACC ACG 
Arg Lys Arg Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser Thr Thr 
65 90 95 

GCC ACA CGC CCG CCC CGA GCC CTG CTC TGC CTG ACC CTG AAG AAC CCC 
Ala Thr Arg Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro 
100 105 110 



104 



48 



96 



144 



192 



240 



288 



336 



ATC CGG AGG GCC TGC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA 
He Arg Arg Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He 
115 120 125 



384 



ATT ATT TTA CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT 
He He Leu Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr 
130 135 140 



432 



ATT CCC TTT CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA 



480 
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lie Pro Phe Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu 
145 150 155 160 

CGA GTG GAA TAT CTC TTT CTC ATA ATT TTT ACG GTG GAA GCG TTT TTA 528 
Arg Val Glu Tyr Leu Phe Leu lie He Phe Thr Val Glu Ala Phe Leu 
165 170 175 

AAA GTA ATC GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAC CTC CGC 576 
Lys Val He Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Arg 
180 185 190 

AAC GGC TGG 7VAC CTA CTA GAT TTT ATA ATT GTG GTT GTG GGG CTT TTT 624 
Asn Gly Trp Asn Leu Leu Asp Phe He He Val Val Val Gly Leu Phe 
195 200 205 

AGT GCA ATT TTA GAA CAA GCA ACC AAA OCA GAT GGG GCA AAC GCT CTC 672 
Ser Ala He Leu Glu Gin Ala Thr Lys Ala Asp Gly Ala Asn Ala Leu 
210 215 220 

GGA GGG AAA GGG GCC GGA TTT GAT GTG AA6 GCG CTO AG6 GCC TTC CGC 720 
Gly Gly Lys Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg 
225 230 235 240 

GTG CTG CGC CCC CTG C6G CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG 768 
Val Leu Arg Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val 
245 250 255 

GTC CTG AAT TCC ATC ATC AAG GCC ATG GTC CCC CTG CTG CAC ATC GCC 816 
Val Leu Asn Ser He He Lys Ala Met Val Pro Leu Leu His He Ala 
260 265 270 

CTG CTT GTG CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG 864 
Leu Leu Val Leu Phe Val He He He Tyr Ala He He Gly Leu Glu 
275 280 285 

CTC TTC ATG GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA 912 
Leu Phe Net Gly Lys Met His Lys Thr Cys Tyr Asn Gin Glu Gly He 
290 295 300 

GCA GAT GTT CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG 960 
Ala Asp Val Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr 
305 310 315 320 

GGC CAC GGG CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG 1008 
Gly His Gly Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp 
325 330 335 

GAT GGT CCC AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC 1056 
Asp Gly Pro Lys His Gly He Thr Asn Phe Asp Asn Phe Ala Phe Ala 
340 345 350 

ATG CTC ACG GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG 1104 
Met Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Trp Thr Asp Val 
355 360 365 

CTG TAC TGG GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT 1152 
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Leu Tyr Trp Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp lie Tyr 
370 375 380 

TTT GTT ACA CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT 1200 
Phe Val Thr X.eu lie lie He Gly Ser Phe Phe Val Leu Asn Leu Val 
385 390 335 400 

CTC GGT GTG CTT AGC GGA GAG TTT TCC AAA GAG AGG GAG AAG GCC AAG 1248 
Leu Gly Val Leu ser Gly Glu Phe Ser Lys Glu Arg Glu Lys Ala Lys 
405 410 415 

GCC CGG GGA GAT TTC GAG AAG CTG CGG GAG AAG CAG GAG CTA GAA GAG 1296 
Ala Arg Gly Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu 
420 425 430 

GAT CTC AAA GGC TAC CTG GAT TGG ATC ACT CAG GCC GAA GAC ATC GAT 1344 
Asp Leu Lys Gly Tyr lieu Asp Trp He Thr Gin Ala Glu Asp He Asp 
435 440 445 

CCT GAG AAT GAG GAC GAA GGC ATG GAT GAG GAG AAG CCC CGA AAC AGA 1392 
Pro Glu Asn Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg Asn Arg 
450 455 460 

GGC ACT CCG GCG GGC ATG CTT GAT CAG AAG AAA GGG AAG TTT GCT TGG 1440 
Gly Thr Pro Ala Gly Net Leu Asp Gin Lys Lys Gly Lys Phe Ala Trp 
465 470 475 480 

TTT AGT CAC TCC ACA GAA ACC CAT GTG AGC ATG CCC ACC AGT GAG ACC 1488 
Phe Ser His Ser Thr Glu Thr His Val Ser Met Pro Thr Ser Glu Thr 
485 490 495 

GAG TCC GTC AAC ACC GAA AAC GTG GCT GGA GGT GAC ATC GAG GGA GAA 1536 
Glu Ser Val Asn Thr Glu Asn Val Ala Gly Gly Asp He Glu Gly Glu 
500 505 510 

AAC T6C GGG GCC AGG CTG GCC CAC CGG ATC TCC AAG TCA AAG TTC AGC 1584 
Asn Cys Gly Ala Arg Leu Ala His Arg He Ser Lys Ser Lys Phe Ser 
515 520 525 

CGC TAC TGG CGC CGG TGG AAT CGG TTC T6C AGA AGG AAG TGC CGC GCC 1632 
Arg Tyr Trp Arg Arg Trp Asn Arg Phe Cys Arg Arg Lys Cys Arg Ala 
530 535 540 

GGA GTC AAG TCT AAT GTC TTC TAC TGG CTG GTG ATT TTC CTG GTG TTC 1680 
Ala Val Lys Ser Asn Val Phe Tyr Trp Leu Val He Phe Leu Val Phe 
545 550 555 560 

CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC CAG CCC AAC TGG 1728 
Leu Asn Thr Leu Thr He ALr Ser Glu His Tyr Asn Gin Pro Asn Trp 
565 570 575 

CTC ACA GAA GTC CAA GAC ACG GCA AAC AAG GCC CTG CTG GCC CTG TTC 1776 
Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu Leu Ala Leu Phe 
580 585 590 

ACG GCA GAG ATG CTC CTG AAG ATG TAC AGC CTG GGC CTG CAG GCC TAC 1824 
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Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Gly Leu Gin Ala Tyr 
595 600 605 

TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC GTG TGT GGC GGC 1872 
Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val Val Cys Gly Gly 
610 615 620 

ATC CTG GAG ACC ATC CTG GTG GAG ACC AAG ATC ATG TCC CCA CTG GGC 1920 
lie Leu Glu Thr lie Leu Val Glu thr Lys lie Met Ser Pro Leu Gly 
625 630 635 640 

ATC TCC GTG CTC AGA TGC GTC CGG CTG CTG AGG ATT TTC AAG ATC ACG 1968 
lie Ser Val Leu Arg Cys Val Arg Leu Leu Arg lie Phe Lys lie Thr 
645 650 655 

AGG TAC TGG AAC TCC TT6 A6C AAC CTG GTG GCA TCC TTG CTG AAC TCT 2016 
Arg Tyr Trp Asn Ser Leu Ser Asn Leu Val Ala Ser Leu Leu Asn Ser 
660 665 670 

GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC CTC TTC ATC ATC 2064 
Val Arg Ser lie Ala Ser lieu Leu Leu Leu Leu Phe Leu Phe lie lie 
675 680 685 

ATC TTC TCC CTC CTG GGG ATG CAG CTC TTT 6QA GGA AAG TTC AAC TTT 2112 
lie Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly Lys Phe Asn Phe 
690 695 700 

GAT GAG ATG CAG ACC CGG AGG AGC ACA TTC GAT AAC TTC CCC CAG TCC 2160 
Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn Phe Pro Gin Ser 
705 710 715 720 

CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC TGG AAT TCG GTG 2208 
Leu Leu Thr Val Phe Gin lie Leu Thr Gly Glu Asp Trp Asn Ser Val 
725 730 735 

ATG TAT GAT GGG ATC ATG GCT TAT GGG GGC CCC TCT TTT CCA GGG ATG 2256 
Met Tyr Asp Gly lie Met Ala Tyr Gly Gly Pro Ser Phe Pro Gly Met 
740 745 750 

TTA GTC TGT ATT TAC TTC ATC ATC CTC TTC ATC TGT GGA AAC TAT ATC 2304 
Leu Val Cys lie Tyr Phe He He Leu Phe He Cys Gly Asn Tyr He 
755 760 765 

CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC CTG GCT GAT GCT 2352 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asp Ala 
770 775 780 

GAG AGC CTC ACA TCT GCC CAA AAG GAG GAG GAA GAG GAG AAG GAG AGA 2400 
Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu Glu Lys Glu Arg 
785 790 795 800 

AAG AAG CTG GCC AGG ACT GCC AGC CCA GAG AAG AAA CAA GAG TTG GTG 2446 
Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys Lys Gin Glu Leu Val 
805 810 815 

GAG AAG CCG GCA GTG GGG GAA TCC AAG GAG GAG AAG ATT GAG CTG AAA 2496 
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Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lys lie Glu Leu Lys 
820 825 830 

TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC AAG ATC AAC ATG 2544 
Ser lie Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr Lys lie Asn Met 
835 840 845 

GAT GAC CTC GAG CCC AAT GAA AAT GAG GAT AAG AGC CCC TAC CCC AAC 2592 
Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser Pro Tyr Pro Asn 
850 855 8e0 

CCA GAA ACT ACA GGA GAA GAG GAT GAG GAS GAG CCA GAG ATG CCT GTC 2640 
Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro Glu Met Pro Val 
865 870 875 880 

GGC CCT CGC CCA CGA CCA CTC TCT GAG CTT CAC CTT AAG GAA AAG GCA 2688 
Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu Bis Leu Lys Glu Lys Ala 
885 890 895 

GTG CCC ATG CCA GAA GCC AGC QCG TTT TTC ATC TTC AGC TCT AAC AAC 2736 
Val Pro Met Pro Glu Ala Ser Ala Phe Phe lie Phe Ser Ser Asn Asn 
900 905 910 

AGG TTT CGC CTC CAG TQC CAC CGC ATT GTC AAT GAC ACG ATC TTC ACC 2784 
Arg Phe Arg Leu Gin Cys His Arg lie Val Asn Asp Thr lie Phe Thr 
915 920 925 

AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT TCC CTG GCT GCT 2832 
Asn Leu lie Leu Phe Phe lie Leu Leu Ser Ser lie Ser Leu Ala Ala 
930 935 940 

GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT ATT CTG TTT TAT 2880 
Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His lie Leu Phe Tyr 
945 950 955 960 

TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA ATT GCT CTG AAG 2928 
Phe Asp lie Val Phe Thr Thr lie Phe Thr lie Glu lie Ala Leu Lys 
965 970 975 

ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT TTC TGC CGG AAC 2976 
Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser Phe Cys Arg Asn 
980 985 990 

TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG TCC CTC ATC TCC 3024 
Tyr Phe Asn lie Leu Asp lieu Leu Val Val Ser Val Ser Leu lie Ser 
995 1000 1005 

TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG ATC TTG CGA GTC 3072 
Phe Gly lie Gin Ser Ser Ala lie Asn Val Val Lys lie Leu Arg Val 
1010 1015 1020 

CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG GCC AAG GGG CTA 3120 
Leu Arg Val Leu Arg Pro Leu Arg Ala lie Asn Arg Ala Lys Gly Leu 
1025 1030 1035 1040 

AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG ACC ATC GGG AAC 3168 
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Lys His Val Val Gin Cys Val Phe Val Ala lie Arg Thr lie Gly Asn 
1045 1050 1055 

ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT GCC TGC ATC GGG 3216 
lie Val He Val Thr Thr Leu Leu Gin Phe Met Phe Ala Cys He Gly 
1060 1065 1070 

GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA GAC AGT TCC AAG 3264 
Val Gin Leu Phe Lys Gly Lys IjCU Tyr Thr Cys Ser Asp Ser Ser Lys 
1075 1080 1085 

CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG TAC AAA GAC GGG 3312 
Gin Thr Glu Ala Qlu Cys Lys Gly Asn Tyr He Thr Tyr Lys Asp Gly 
1090 1095 1100 

GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG GAG AAC AGC AAG 3360 
Glu Val Asp His Pro He He Gin Pro Arg Ser Trp Glu Asn Ser Lys 
1105 1110 1115 1120 

TTT GAC TTT GAC AAT GTT CTG GCA GCC ATG ATG GCC CTC TTC ACC GTC 340B 
Phe Asp Phe Asp Asn Val Leu Ala Ala Net Net Ala Leu Phe Thr Val 
1125 1130 1135 

TCC ACC TTC GAA GGG TGG CCA GAG CTG CTG TAC CGC TCC ATC GAC TCC 3456 
Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Tyr Arg Ser He Asp Ser 
1140 1145 1150 

CAC ACG GAA GAC AAG GGC CCC ATC TAC AAC TAC CGT GTG GAG ATC TCC 3504 
His Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg Val Glu He Ser 
1155 1160 1165 

ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC TTC ATG ATG AAC 3552 
He Phe Phe He He Tyr He He He He Ala Phe Phe Net Net Asn 
1170 1175 1180 

ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG CAG GGG GAG CAG 3600 
He Phe Val Gly Phe Val He Val Thr Phe Gin Glu Gin Gly Glu Gin 
1185 1190 1195 1200 

GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA CAG TGC GTG GAA 3648 
Glu Tyr Lys Asn Cys Glu I^eu Asp Lys Asn Gin Arg Gin Cys Val Glu 
1205 1210 1215 

TAC GCC CTC AAG GCC CGG CCC CTG CGG AGG TAC ATC CCC AAG AAC CAG 3696 
Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He Pro Lys Asn Gin 
1220 1225 1230 

CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC TAC TTC GAG TAC 3744 
His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr Tyr Phe Glu Tyr 
1235 1240 1245 

CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC CTG GCC ATG CAG 3792 
Leu Met Phe Val Leu He lieu Leu Asn Thr He Cys Leu Ala Net Gin 
1250 1255 1260 

CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG AAC ATC CTC AAC 3840 
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His Tyr Gly Gin Ser Cys Leu Phe Lys lie Ala Met Asn lie Leu Asn 
1265 1270 X275 1280 

ATG CTC TTC ACT GGC CTC TTC ACC GTG GAG ATG ATC CTG AAG CTC ATT 3888 
Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met lie Leu Lys Leu He 
1285 1290 1235 

GCC TTC AAA CCC AAG GGT TAC TTT AGT GAT CCC TGG AAT GTT TTT GAC 3936 
Ala Phe Lys Pro Lys Gly Tyr Phe Ser Asp Pro Txp Asn Val Phe Asp 
1300 1305 1310 

TTC CTC ATC GTA ATT GGC AGC ATA ATT GAC GTC ATT CTC AGT GAG ACT 3984 
Phe Leu He Val He Gly Ser He He Asp Val He Leu Ser Glu Thr 
1315 1320 1325 

AAT CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG AAC GCA GAG QAA 4032 
Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met Asn Ala Glu Glu 
1330 1335 1340 

AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC CGG GTC ATG COT 4080 
Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe Arg Val Met Arg 
1345 1350 1355 1360 

CTG GTG AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG ACG CTG CTG TGG 4128 
Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg Thr Leu Leu Trp 
1365 1370 1375 

ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG GCC CTC CTG ATC 4176 
Thr Phe He Lys Ser Phe Gin Ala Leu Pro Tyr Val Ala Leu Leu He 
1380 1385 1390 

GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG CAG GTG TTT GGG 4224 
Val Met Leu Phe Phe He Tyr Ala Val He Gly Met Gin Val Phe Gly 
1395 1400 1405 

AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG AAC AAC AAC TTT 4272 
Lys He Ala Leu Asn Asp Thr Thr Glu He Asn Arg Asn Asn Asn Phe 
1410 1415 1420 

CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AOG TGT GCC ACC GGG 4320 
Gin Thr Phe Pro Gin Ala Val Leu Leu Leu Phe Arg Cys Ala Thr Gly 
1425 1430 1435 1440 

GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA GGC AAG AAG TGT 4368 
Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro Gly Lys Lys Cys 
1445 1450 1455 

GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT GAA ACA CCC TGT 4416 
Ala Pro Glu Ser Glu Pro Ser Asn Ser Thr Glu Gly Glu Thr Pro Cys 
1460 1465 1470 

GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC TAC ATG CTC TGT 4464 
Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe Tyr Met Leu Cys 
1475 1480 1485 

GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC ATG GAC AAC TTT 4512 
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Ala Phe Leu He He Asn Leu Phe Val Ala Val He Met Asp Asn Phe 
1490 1495 1500 

GAC TAG CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC CAC CAC CTG GAT 4560 
Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro His His Leu Asp 
1505 1510 1515 1520 

GAG TTT AAA AGA ATC TGG GCA GA6 TAT GAC CCT GAA GCC AAG GGT CGT 4608 
Glu Phe Lys Arg He Trp Ala Glu Tyr Asp Pro Glu Ala Lys Gly Arg 
1525 1530 1535 

ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG ATT CAG CCG CCA 4656 
He Lys His Leu Asp Val Val Thr Leu Leu Arg Ar^ He Gin Pro Pro 
1540 1545 1550 

CTA GGT TTT GGG AAG CTG TGC CCT CAC CGC GTG GCT TGC AAA CGC CTG 4704 
Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala Cys Lys Arg Leu 
1555 1560 1565 

GTC TCC ATG AAC ATG CCT CTG AAC AGC GAC GGG ACA GTC ATG TTC AAT 4752 
Val Ser Met Asn Met Pro Leu Asn Ser Asp Gly Thr Val Met Phe Asn 
1570 1575 1580 

GCC ACC CTG TTT GCC CTG GTC AGG ACG GCC CTG AGG ATC AAA ACA GAA 4800 
Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg He Lys Thr Glu 
1585 1590 1595 1600 

GGG AAC CTA GAA CAA GCC AAT GAG GAG CTG CGG GCG ATC ATC AAG AAG 4848 
Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala He He Lys Lys 
1605 1610 1615 

ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG GTG GTG CCC CCT 4896 
He Trp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin Val Val Pro Pro 
1620 1625 1630 

GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAG GCC ACG TTC CTG 4944 
Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr Ala Thr Phe Leu 
1635 1640 1645 

ATC CAG GAG TAC TTC CGG AAG TTC AAG AAG CGC AAA GAG CAG GGC CTT 4992 
He Gin Glu Tyr Phe Arg Lys Phe Lys Lys Arg Lys Glu Gin Gly Leu 
1650 1655 1660 

GTG GGC AAG CCC TCC CAG AGG AAC GCG CTG TCT CTG CAG GCT GGC TTG 5040 
Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu Gin Ala Gly lieu 
1665 1670 1675 1680 

CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG GCC ATC TCT GGA 5088 
Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg Ala He Ser Gly 
1685 1690 1695 



GAT CTC ACC GCT GAG GAG GAG CTG GAC AAG GCC ATG AAG GAG GCT GTG 
Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met: Lys Glu Ala Val 
1700 1705 1710 



5136 



TCC GCT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC GGT GGC CTG TTC 



5184 
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Ser Ala Ala Ser Glu Asp Asp He Phe Arg Arg Ala Gly Gly Leu Phe 
1715 1720 1725 



GGC AAC CAC 
Gly Asn His 
1730 


GTC AGC TAC TAC CAA 
Val Ser Tyr Tyr Gin 
1735 


AGC GAC 
Ser Asp 


GGC 
Gly 


CGG AGC 
Arg Ser 
1740 


GCC 
Ala 


TTC 

Phe 


CCC 
Pro 


5232 


CA6 ACC 
Gin Thr 
1745 


TTC 
Phe 


ACC ACT CAG CGC 
Thr Thr Gin Arg 
1750 


CCG 
Pro 


CTG CAC 
Leu His 


ATC AAC 
He Asn 
1755 


AAG 
Lys 


GCG 
Ala 


GGC AGC 
Gly Ser 
1760 


5280 


AGC 
Ser 


CAG 
Gin 


GGC 
Gly 


GAC ACT GAG 
Asp Thr Glu 
1765 


TCG 
Ser 


CCA 
Pro 


TCC CAC GAG 
Ser His Glu 
1770 


AAG 
Lys 


CTG 
Leu 


GTG 

Val 


GAC TCC 
Asp Ser 
1775 


5328 


ACC 
Thr 


TTC 
Phe 


ACC 
Thr 


CCG AGC 
Pro Ser 
1780 


AGC 
Ser 


TAC 
Tyr 


TCG 
Ser 


TCC ACC 
Ser Thr 
1785 


GGC 
Gly 


TCC 
Ser 


AAC 
Asn 


GCC AAC 
Ala Asn 
1790 


ATC 
He 


5376 


AAC 
Asn 


AAC 
Asn 


GCC AAC 
Ala Asn 
1795 


AAC 
Asn 


ACC 
Thr 


GCC 
Ala 


CTG GGT 
Leu Gly 
1800 


CGC 
Arg 


CTC 
Leu 


CCT 
Pro 


CGC CCC 
Arg Pro 
1805 


GCC 
Ala 


GGC 
Gly 


5424 


TAG 
Tyr 


CCC AGC 
Pro Ser 
1810 


ACA 
Thr 


GTC 
Val 


AGC 
Ser 


ACT GTG 
Thr Val 
1815 


GAG 
Glu 


GGC 
Gly 


CAC 
His 


GGG CCC 
Gly Pro 
1820 


CCC 
Pro 


TTG 
Leu 


TCC 
Ser 


5472 


CCT GCC 

Pro Ala 

1825 


ATC 
He 


CGG 
Arg 


GTG 
Val 


CAG GAG 
Gin Glu 
1B30 


GTG 
Val 


GC6 
Ala 


TGG 
Trp 


AAG CTC 
Lys Leu 
1835 


AGC 
Ser 


TCC 
Ser 


AAC 
Asn 


AGG 
Arg 
1840 


5520 


TGC 
Cys 


CAC 
His 


TCC 
Ser 


CGG 
Arg 


GAG AGC 
Glu Ser 
1845 


CAG 
Gin 


GCA 
Ala 


GCC 
Ala 


ATG GCG 
Net Ala 
1850 


CGT 
Arg 


CAG 
Gin 


GAG 
Glu 


GAG ACG 
Glu Thr 
1855 


5568 


TCT 
Ser 


CAG 
Gin 


GAT 
Asp 


GAG ACC 
Glu Thr 
1860 


TAT 
Tyr 


GAA 
Glu 


GTG 
Val 


AAG ATG 
Lys Het 
1865 


AAC 
Asn 


CAT 
His 


GAC 
Asp 


ACG GAG 
Thr Glu 
1870 


GCC 
Ala 


5616 


TGC 
Cys 


AGT 
Ser 


GAG CCC 
Glu Pro 
1875 


AGC 
Ser 


CTG 
Leu 


CTC 
Leu 


TCC ACA 
Ser Thr 
1880 


GAG 
Glu 


ATG 
Met 


CTC 
Leu 


TCC TAC 
Ser Tyr 
1885 


CAG 
Gin 


GAT 
Asp 


5664 


GAC 
Asp 


GAA AAT 
Glu Asn 
1890 


CGG 
Arg 


CAA 

Gin 


CTG 
Leu 


ACG CTC 
Thr Leu 
1895 


CCA 
Pro 


GAG 
Glu 


GAG 
Glu 


GAC AAG 
Asp Lys 
1900 


AGG 
Axg 


GAC 
Asp 


ATC 

He 


5712 


CGG CAA TCT 
Arg Gin Ser 
1905 


CCG 
Pro 


AAG 
Lys 


AGG GGT TTC 
Arg Gly Phe 
1910 


CTC 
Leu 


CGC 
Arg 


TCT GCC 
Ser Ala 
1915 


TCA 
Ser 


CTA 
Leu 


GGT 

Gly 


CGA 
Arg 
1920 


5760 


AGG GCC TCC 
Arg Ala Ser 


TTC 

Phe 


CAC CTG GAA TGT 
His Leu Glu Cys 
1925 


CTG 
Leu 


AAG CGA CAG AAG 
Lys Arg Gin Lys 
1930 


GAC 
Asp 


CGA GGG 
Arg Gly 
1935 


5808 


GGA 


GAC 


ATC 


TCT 


CAG 


AAG 


ACA 


GTC 


CTG 


CCC 


TTG 


CAT 


CTG 


GTT 


CAT 


CAT 


5856 
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Gly Asp lie Ser 6ln Lys Thr Val Leu Pro Leu His Leu Val His His 
1940 1945 1950 

CAG 6CA TTG GCA GTG GCA GGC CTG A6C CCC CTC CTC CAG AGA AGC CAT 5904 
Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu Gin Arg Ser His 
1955 1960 1965 

TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA CCA GCC ACA CCT 5952 
Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro Pro Ala Thr Pro 
1970 1975 1980 

GGC AGC CGA GGC TGG CCC CCA CAG CCC 6TC CCC ACC CTG CGG CTT GAG 6000 
Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr Leu Arg Leu Glu 
1985 1990 1995 2000 

GGG GTC GAG TCC AGT GAG AAA CTC AAC AGC AGC TTC CCA TCC ATC CAC 6048 
Gly Val Glu Ser Ser Glu Lys Leu Asn Ser Ser Phe Pro Ser lie His 
2005 2010 2015 

TGC GGC TCC TGG GCT GAG ACC ACC CCC GGT GGC GGG GGC AGC AGC GCC 6096 
Cys Gly Ser Trp Ala Glu Thr Thr Pro Gly Gly Gly Gly Ser Ser Ala 
2020 2025 2030 

GCC CGG AGA GTC CGG CCC GTC TCC CTC ATG GTG CCC AGC CAG GCT GGG 6144 
Ala Arg Arg Val Arg Pro Val Ser Leu Met Val Pro Ser Gin Ala Gly 
2035 2040 2045 

GCC CCA GGG AGG CAG TTC CAC GGC AGT GCC AGC AGC CTG GTG GAA GCG 6192 
Ala Pro Gly Arg Gin Phe His Gly Ser Ala Ser Ser Leu Val Glu Ala 
2050 2055 2060 

GTC TTG ATT TCA GAA GGA CTG GGG CAG TTT GCT CAA GAT CCC AAG TTC 6240 
Val Leu lie Ser Glu Gly Leu Gly Gin Phe Ala Gin Asp Pro Lys Phe 

2070 2075 2080 2085 

ATC GAG GTC ACC ACC CAG GAG CTG GCC GAC GCC TGC GAC ATG ACC ATA 6288 
lie Glu Val Thr Thr Gin Glu Leu Ala Asp Ala Cys Asp Met Thr He 
2090 2095 2100 

GAG GAG ATG GAG AGC GCG GCC GAC AAC ATC CTC AGC GGG GGC GCC CCA 6336 
Glu Glu Met Glu Ser Ala Ala Asp Asn He Leu Ser Gly Gly Ala Pro 
2105 2110 2115 

CAG AGC CCC AAT GGC GCC CTC TTA CCC TTT GTG AAC TGC AGG GAC GCG 6384 
Gin Ser Pro Asn Gly Ala Leu l«eu Pro Phe Val Asn Cys Arg -Asp Ala 
2120 2125 2130 

GGG CAG GAC CGA GCC GGG GGC GAA GAG GAC GCG GGC TGT GTG CGC GCG 6432 
Gly Gin Asp Arg Ala Gly Gly Glu Glu Asp Ala Gly Cys Val Arg Ala 
2135 2135 2140 

CGG GGT CGA CCG AGT GAG GAG GAG CTC CAG GAC AGC AGG GTC TAC GTC 6480 
Arg Gly Arg Pro Ser Glu Glu Glu Leu Gin Asp Ser Arg Val Tyr Val 
2145 2150 2155 2160 



Tvnr* Jkfzr* cvfs. TunTnruznfin Tfinnikni^Tfir* w^nrTTTTTT T*raTTT/5*rTT f aaTHTTr'nT 
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Ser Ser Leu 

AATGGGTTCG TTTCAGAAGT GCCTCACTGT TCTCGT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AGACCACGGC TTCCTCO/UIT CTTGOGCGAA GCCGCCGGCCA TCGGAGGAG GGATTAATCC 
AGACCC6CCG GGGGGTGTTT TCACATTTCT TCCTCTTCGTG GCTGCTCCT CCTATTAA3UI 
CCATTTTTGG TCC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 hsiSB pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGCTGAGGGC CTTCCGCGTG CTGCGCCCCC TGCGGCTGQT GTCCGGAGTC CCAAGTCTCC 
AGGTGGTCCT GAATTCCATC ATCAAGGCC 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .84 

(D) OTHER INFORMATION: /note= "7^ alternative exon of 
alpha-lC." 

(xi) SEQX3ENCE DESCRIPTION: SEQ ID NO: 6: 
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CAC TAT TTC TGT GAT GCA TGG AAT ACA TTT GAC GCC TTG ATT 6TT GT6 
His Tyr Phe Cys Asp Ala Tzp Asn Thr Phe Asp Ala Leu He Val Val 
15 10 15 



48 



GGT AGC ATT 6TT GAT ATA GCA ATC ACC GAG 6TA AAC 
Gly Ser He Val Asp He Ala He Thr Glu Val Asn 
20 25 



84 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECOLE TYPE: DNA (genomic) 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 144.. 7163 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..143 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATIC»i: 7161*. 7362 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



6CGGCGGCGG CTGCGGCGGT GGGGCCGGGC GAGGTCC6TG C6GTCCCGGC GGCTCCGTGG 60 

CTGCTCCGCT CTGAGCGCCT GC6CGCCCCG CGCCCTCCCT GCCGGGGCCG CTGOGCCQGG 120 

GAT6CACGCG GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 

Met Val Arg Phe Gly Asp Glu Leu Gly 



GGC 
Gly 
10 


CGC 
Arg 


TAT 
Tyr 


GGA 
Gly 


GGC 

Gly 


CCC 
Pro 
15 


GGC 
Gly 


1 

GGC 
Gly 


GGA 

Gly 


GAG 
Glu 


CGG 
Arg 
20 


5 

GCC 
Ala 


CGG 
Arg 


GGC 
Gly 


GGC 
Gly 


GGG 
Gly 
25 


218 


GCC 
Ala 


GGC 
Gly 


GGG 
Gly 


GCG 
Ala 


GGG 
Gly 
30 


GGC 
Gly 


CCG 
Pro 


GGT 
Gly 


CCC 
Pro 


GGG 
Gly 
35 


GGG 
Gly 


CTG 
Leu 


CAG 
Gin 


CCC 
Pro 


GGC 
Gly 
40 


CAG 
Gin 


266 


CGG 
Arg 


GTC 

Val 


CTC 

Leu 


TAC 
Tyr 
45 


AAG 

Lys 


CAA 
Gin 


TCG 
Ser 


ATC 
He 


GCG 
Ala 
50 


CAG 
Gin 


CGC 
Arg 


GCG 
Ala 


CGG 
Arg 


ACC 
Thr 
55 


ATG 
Met 


GCG 
Ala 


314 


CTG 
Leu 


TAC 
Tyr 


AAC 
Asn 
60 


CCC 
Pro 


ATC 
He 


CCG 
Pro 


GTC 
Val 


AAG 
Lys 
65 


CAG 
Gin 


AAC 
Asn 


TGC 
Cys 


TTC 
Phe 


ACC 
Thr 
70 


GTC 
Val 


AAC 
Asn 


CGC 
Arg 


362 
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TCG CTC TTC GTC TTC AGC GAG GAC AAC GTC GTC CGC AAA TAG GCG AAG 410 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 458 
Arg He Thr Glu Trp Pro Pro Phe Glu Asn Met He Leu Ala Thr He 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG CAC CTC CCT GAT GGG 506 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
110 115 120 

QAC AAA ACQ CCC ATG TCC GAG CG6 CTG GAC GAC ACG GAG CCC TAT TTC 554 
Asp Lys Thr Pro Net Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 135 

ATC GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG GGC 602 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 150 

TTT GTC TTC CAC AAG GGC TCT TAC CTG CGG AAC GGC TGG AAC GTC ATG 650 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAC TTC GTG GTC GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAC 698 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
170 175 180 185 

TTC GAC CTG CGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CCC CTG AAG 746 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TCC ATC ATG 794 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Met 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 842 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAC ATG GGC AAG TTC 890 
He Leu Net Phe Ala He He Gly Leu Glu Phe Tyr Net Gly Lys Phe 
235 240 245 

CAC AAG GCC TGT TTC CCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 938 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 255 260 265 

TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CTG TGC GAG GGC GAC ACT 986 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GGA CCC AAC TTT GGC ATC ACC AAC TTT 1034 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly He Thr Asn Phe 
285 290 295 
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GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 1082 
Asp Asn lie Leu Phe Ala lie Leu Hut Val Phe Gin Cys He Thr Met 
300 305 310 

GAG GGC TGG ACT GAG ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 1130 
Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 

315 320 325 

ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 1178 
Thr Trp Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 1226 
Phe Met Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 1274 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 1322 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAG AGG AAT GCA GAG GAG 1370 
Lys Ala Glu Glu Val Net Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 1418 
Lys Ser Pro Leu Asp Val Leu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 420 425 

AAT GAC CTG ATC CAC GCA GAG GAG GGA G2^ GAC CGG TTT GCA GAT CTC 1466 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 1514 
Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 1562 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 1610 
Phe lie Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 480 485 

TGC GTG GTG GCC CTG AAC ACA CTG TGT GTG GCC ATG GTG CAT TAC AAC 1658 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 500 505 

CAG CCG CGG CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 1706 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
510 515 520 
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CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AA6 ATG TAT GGC CTG G6G 1754 
lieu 61y Leu Phe Leu Thr Qlu Met Ser Leu Lys Met Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAG TTC CX3G TCC TCC TTC AAC TGC TTC GAC TTT GG6 GTC 1802 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG 6CC ATC AAS CCG GGA 1850 
He Val Gly Ser Val Phe Glu Val Val Trp Ala Ala He Lys Pro Gly 
555 560 565 

AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG AGG ATC 1898 
Ser Ser Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He 
570 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 1946 
Phe Lys Val Thr Lys Tyr Txp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 600 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 1994 
Leu Leu Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe 
605 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG CAG CTG TTT GGG GGA 2042 
Leu Phe He Val Val Phe Ala Leu Leu Gly Net Gin Leu Phe Gly Gly 
620 625 630 

CAG TTC AAC TTC CAG GAT GAG ACT CCC ACA ACC AAC TTC GAC ACC TTC 2090 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Thr Asn Phe Asp Thr Phe 
635 640 645 

CCT GCC GCC ATC CTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 2138 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp 
650 ^ 655 660 665 

AAT GCA GTG ATG TAT CAC GGG ATC GAA TCG CAA GGC GGC GTC AGC AAA 2186 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TCG TCC TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 2234 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GCC ATC GCT GTG GAC AAC CTG GCC 2282 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 

AAC GCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 2330 
Asn Ala Gin Glu Leu Thr Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 
715 720 725 

AAT CAG AAG CTT GCT CTG CAA AAG GCC AAA GAA GTG GCT GAA GTC AGC 2378 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 735 740 745 
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CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAG AAC TCG 2426 
Pro Met Ser Ala Ala Asn He Ser He Ala Ala Arg Gin Gin Asn Ser 
750 755 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 2474 
Ala Lys Ala Arg Ser Val Trp Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAC AGC GAG ATG GAC CCC 2522 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 2570 
Glu Glu A3:g Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
795 800 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 2618 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Gly 
810 815 820 825 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 2666 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 2714 
Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 2762 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 870 

GCG GAG AGC GGG GAG CCC GGT GCC CGG GAG GAG CGG CCG CGG CCG CAC 2810 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 2858 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
890 895 900 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 2906 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 2954 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 935 

CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG GGC GAG 3002 
His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CGA GCG GGG CCC CGG GAO GCG 3050 
Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 
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OAG AGC GGG GAG GAG CCG GC6 CGG CGG CAC CGG GCC CG6 CAC AAG GCG 3098 
Qlu Ser Gly Glu Qlu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 985 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC ACG GAG AAG GAG GCC 3146 
Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 iooo 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 3194 
Thr Glu Lys Glu Ala Glu lie Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 1015 

CGG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 3242 
Arg Asn His Qln Pro Arg Glu Pro His Cys Asp Leu Glu Thr Ser Gly 
1020 1025 1030 

ACT GTG ACT GTG GGT CCC ATG CAC ACA CTG CCC AGC ACC TGT CTC CAG 3290 
Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 3338 
Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Qln Arg Asn Val Thr 
1050 1055 1060 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 3386 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr lie Val His He Pro 
1070 1075 1080 

GTG ATG CTG ACG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 3434 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 3482 
Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 1105 1110 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 3530 
Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr Ser Ser 
1115 1120 1125 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 3578 
Met Phe Cys Leu Ser Pro Thr Asn Leu Xieu Arg Arg Phe Cys His Tyr 
1130 1135 1140 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 3626 
He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
1150 1155 1160 

TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp Ser 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3722 
Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr Gly Val 
1180 1185 1190 
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TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 3770 
Phe Thr Phe Glu Net Val lie Lys Met lie Asp Leu Gly Leu Leu Leu 
1195 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 3818 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn lie Leu Asp Phe lie 
1210 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG GCG TTT GCT TTC TCA GGA TCC AAA GGG 3866 
Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG AGA GTC CTT CGT GTC CTG CGG 3914 
Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 1255 

CCC CTC AAG ACC ATC AAA CGG CTG CCC AAG CTC AAG GCT GTG TTT GAC 3962 
Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 4010 
Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 

ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG GAG CTC TTC AAA 4058 
Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1290 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GTIA TCC AAG GAG CTG GAG AGG GAC 4106 
Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CA6 4154 
Cys Arg Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 4202 
Pxx> Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 4250 
Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 4298 
Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
1370 1375 1380 1385 

CCT GGG TAC CGC ATG GAG CTG TCC ATC TTC TAC GTG GTC TAC TTT GTG 4346 
Pro Gly Tyr Arg Met Glu Leu Ser He Phe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 4394 
Val Phe Pro Phe Phe Phe Val Asn He Phe Val Ala Leu He He He 
1405 1410 1415 
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ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 4442 
Thr Phe Gin Glu Gin Gly Asp Lys Val Met Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 4490 
Lys Asn Glu Arg Ala Cys He Asp Phe Ala He Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG AOG TGG 4538 
Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Trp 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 4586 
Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe He Met Ala Met He 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 4634 
Ala Leu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 4682 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 4730 
Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG GGA 4778 
Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCG GAA ACG AAC AAT TTC 4826 
Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 4874 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 4922 
Leu Leu Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val 
1580 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 4970 
Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 1600 1605 

TTC TTC ATC TAC GCC ATC ATC GGC ATG CAG GTG TTT GGG AAT ATT GCC 5018 
Phe Phe He Tyr Ala He He Gly Met Gin Val Phe Gly Asn He Ala 
1610 1615 1620 1625 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC AAC AAC TTC CGG ACG TTT 5066 
Leu Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1630 1635 1640 
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TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC TGG 5114 
Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Tzp 
1645 1650 1655 

CAC GAG ATC ATG CTG TCC TGC CTG AGC AAC CAG GCC TGT GAT GAG CAG 5162 
His Qlu lie Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 1670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC GTC 5210 
Ala Asn Ala Tbr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe Val 
1675 1680 1685 

TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 5258 
Ser Phe lie Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe Val Ala 
1690 1695 1700 1705 

GTG ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA 5306 
Val lie Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser lie Leu 
1710 1715 1720 

GGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 5354 
Gly Pro His His Leu Asp Glu Phe lie Arg Val Trp Ala Glu Tyr Asp 
1725 1730 1735 

CCG GCT GCQ TGT GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 5402 
Pro Ala Ala Cys Gly Arg lie Ser Tyr Asn^Asp Met Phe Glu Met Leu 
1740 1745 1750 

AAA CAC ATG TCC CCG CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 5450 
Lys His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 
1755 1760 1765 

GTT GCT TAC AAG CGC CTG GTT CGC ATG AAC ATG CCC ATC TCC AAC GAG 5498 
Val Ala Tyr Lys Arg Leu Val Arg Met Asn Met Pro lie Ser Asn Glu 
1770 1775 1780 1785 

GAC ATG ACT GTT CAC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 5546 
Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 
1790 1795 1800 

GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 5594 
Ala Leu Glu He Lys Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1805 1810 1815 

GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG CCC 5642 
ABp Ala Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu Pro 
1820 1825 1830 

CAG AAG ACT TTG GAC TTG CTG GTA CCA CCC CAT AAG CCT GAT GAG ATG 5690 
Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
1835 1840 1845 

ACA GTG GGG AAG GTT TAT GCA OCT CTG ATG ATA TTT GAC TTC TAC AAG 5738 
Thr Val Gly Lys Val Tyr Ala Ala Leu Met He Phe Asp Phe Tyr Lys 
1850 1855 1860 1865 
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CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC CTC 5786 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin Gin Ala Pro Gly Gly Leu 
1870 1875 1880 

TCC CAG ATG GGT CCT GTG TCC CTG ITC CAC CCT CTG AAG GCC ACC CTG 5834 
Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1885 1890 1895 

GAG CAG ACA CAG CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 5882 
Glu Gin Thr Gin Pro Ala Val Leu Arg Qly Ala Arg Val Phe Leu Arg 
1900 1905 1910 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAA 5930 
Gin Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala He Gin Asn Gin 
1915 1920 . 1925 

GAG ACT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 5978 
Glu Ser Gly He Lys Glu Ser Val Ser Trp Gly Thr Gin Arg Thr Gin 
1^30 1935 1940 1945 

GAT OCA CCC CAT GAG GCC AGG CCA CCC CTG GAG COT GGC CAC TCC ACA 6026 
Asp Ala Pro His Glu Ala Arg Pro Pro Leu Glu Arg Gly His Ser Thr 
1950 1955 I960 

GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC GTT CAG ATG 6074 
Glu He Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val Gin Met 
1965 1970 1975 

CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 6122 
Gin Ser He Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 6170 
Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 6218 
Gin Pro Val Thr Asp Ala Ser Pro Met Lys Arg Ser He Ser Thr Leu 
2010 2015 2020 2025 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 6266 
Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg 
2030 2035 2040 

CCA CCC CCT AGC CAG GCG TCG TCG CAC CAC CAC CAC CAC CGC TGC CAC 6314 
Pro Pro Pro Ser Gin Ala Ser Ser His His His His His Arg Cys His 
2045 2050 2055 

CGC CGC AGG GAC AGG AAG CAG AGG TCC CTG GAG AAG GGG CCC AGC CTG 6362 
Arg 7^ Arg Asp Arg Lys Gin Arg Ser Leu Glu Lys Gly Pro Ser Leu 
2060 2065 2070 

TCT GCC GAT ATG GAT GGC GCA CCA AGC AGT GCT GTG GGG CCG GGG CTG 6410 
Ser Ala Asp Met Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 
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CCC CCG GGA GAQ GGG CCT ACA GGC TGC CGG CGG GAA CGA GAG CGC CGG 
Pro Pro Gly Glu Gly Pro Thr Gly Cys Arg Arg Glu Arg Glu Arg Arg 
2090 2095 2100 2105 

CaW3 GAG CGG GGC CGG TCC CAG GAG CGG AGG CAG CCC TCA TCC TCC TCC 
Gin Glu Arg Gly Arg Ser Gin Glu Arg Arg Gin Pro Ser Ser Ser Ser 
2110 2115 2120 

TCG GAG AAG CAG CGC TTC TAC TCC TGC GAC CGC TTT GGG GGC C6T GAG 
Ser Glu Lys Gin Arg Phe Tyr Ser Cys Asp Arg Phe Gly Gly Arg Glu 
2125 2130 2135 

CCC CCG AAG CCC AAG CCC TCC CTC A6C AGC CAC CCA ACO TCG CCA ACA 
Pro Pro Lys Pro Lys Pro Ser Leu Ser Ser His Pro Thr Ser Pro Thr 
2140 2145 2150 

GCT GGC CAG GAG CCG GGA CCC CAC CCA CAG GGC A6T GGT TCC GTO AAT 
Ala Gly Gin Glu Pro Gly Pro His Pro Gin Gly Ser Gly Ser Val Asn 
2155 21€0 2165 



6458 



6506 



6554 



6602 



6650 



GGG AGC CCC TTG CTG TCA ACA TCT GGT GCT AGC ACC CCC GGC CGC GGT 
Gly Ser Pro Leu Leu Ser Thr Ser Gly Ala Ser Thr Pro Gly Arg Gly 
2170 2175 2180 2185 



GGG CGG AGG CAG CTC CCC CAG ACG CCC CTG ACT CCC CGC CCC AGC ATC 
Gly Arg Arg Gin Leu Pro Gin Thr Pro Leu Thr Pro Arg Pro Ser He 
2190 2195 2200 

ACC TAC AAG ACG GCC AAC TCC TCA CCC ATC CAC TTC GCC GGG GCT CAG 
Thr Tyr Lys Thr Ala Asn Ser Ser Pro He His Phe Ala Gly Ala Gin 
2205 2210 2215 

ACC AGC CTC CCT GCC TTC TCC CCA GGC CGG CTC AGC CGT GGG CTT TCC 
Thr Ser Leu Pro Ala Phe Ser Pro Gly Arg Leu Ser Arg Gly Leu Ser 
2220 2225 2230 

GAA CAC AAC GCC CTG CTG CAG AGA GAC CCC CTC AGC CAG CCC GTG GCC 
Glu His Asn Ala Leu Leu Gin Arg Asp Pro Leu Ser Gin Pro Leu Ala 
2235 2240 2245 

CCT GGC TCT CGA ATT GGC TCT GAC CCT TAC CTG GGG CAG CGT CTG GAC 
Pro Gly Ser Arg He Gly Ser Asp Pro Tyr Leu Gly Gin Arg Leu Asp 
2250 2255 2260 2265 

AGT GAG GCC TCT GTC CAC GCC CTG CCT GAG GAC ACG CTC ACT TTC GAG 
Ser Glu Ala Ser Val His Ala Leu Pro Glu Asp Thr Leu Thr Phe Glu 
2270 2275 2280 

GAG GCT GTG GCC ACC AAC TCG GGC CGC TCC TCC AGG ACT TCC TAC GTG 
Glu Ala Val Ala Thr Asn Ser Gly Arg Ser Ser Arg Thr Ser Tyr Val 
2285 2290 2295 

TCC TCC CTG ACC TCC CAG TCT CAC CCT CTC CGC CGC GTG CCC AAC GGT 
Ser Ser Leu Thr Ser Gin Ser His Pro Leu Arg Arg Val Pro Asn Gly 
2300 2305 2310 



6698 



6746 



6794 



6842 



6890 



6938 



6986 



7034 



7082 
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TAC CAC TGC ACC CTG GGA CTC AGC TCG GGT GGC CGA GCA CGG CAC AGC 7130 
Tyr His Cys Thr Leu Gly Leu Ser Ser Gly Gly Arg Ala Tkrg His Ser 

2315 2320 2325 

TAC CAC CAC CCT GAC CAA GAC CAC TGG TGC TAGCTGCACC GTCACCGCTC 7180 
Tyr His His Pro Asp Gin Asp His Trp Cys 
2330 2335 234 

AGACGCCT6C ATGCAGCAGQ CGTGTGTTCC AGTGGATGAG TTTTATCATC CACACGQGGC 7240 
AGTCGGCCCT CGGGGGAGGC CTTGCCCACC TTGGTGAGGC TCCTQTGGCC CCTCCCTCCC 7300 
CCTCCTCCCC TCTTTTACTC TAGACGACGA ATAAAGCCCT GTTGCTTGAG TGTACGTACC 7360 

7362 

(2) INFORMATION FOR SBQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7175 base pairs 

(B) TypE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: DNA (genomic) 

(ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) LOCATICni: 144.. 6857 

(ix) FEATDRE: 

(A) NAME/KEY: 5'DTR 

(B) LOCATION: 1. .143 

(ix) FEATDRE: 

(A) NAME/KEY: 3'DTR 

(B) IX)CATION: 6855. .7175 

(xi) SE(}DENCE DESCRIPTION: SEQ ID NO: 8: 

GCGGCGGCGG CTGCGGCGGT GGGGCOGGGC GAGGTCCGTG CGGTCCCGGC GGCTCC6TGG 60 

CTGCTCCGCT CTGAGCOCCT GCGCQCCCCG CGCCCTCCCT 6CCGGGGCCG CTGGGCCGGG 120 

GATGCAC6C6 GGGCCCGGGA GCC ATG GTC CGC TTC GGG GAC GAG CTG GGC 170 

Met Val Arg Phe Gly Asp Glu Leu Gly • 
1 5 

GGC CQC TAT GGA GGC CCC GGC GGC GGA GAG CGG GCC CGG GGC GGC GGG 218 
Gly Arg Tyr Gly Gly Pro Gly Gly Gly Glu Arg Ala Arg Gly Gly Gly 
10 15 20 25 

GCC GGC GGG GCG GGG GGC CCG GGT CCC GGG GGG CTG CAG CCC GGC CAG 266 
Ala Gly Gly Ala Gly Gly Pro Gly Pro Gly Gly Leu Gin Pro Gly Gin 
30 35 40 
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CGG GTC CTC TAG AAG CAA TCG ATC GCG CAG CGC GCG CGG ACC ATG GCG 314 
Arg Val Leu Tyr Lys Gin Ser He Ala Gin Arg Ala Arg Thr Met Ala 
45 50 55 

CTG TAG AAG GCG ATC GCG GTG AAG CAG AAG TGC TTC ACC GTC AAC OGC 362 
Leu Tyr Asn Pro lie Pro Val Lys Gin Asn Cys Phe Thr Val Asn Arg 
60 65 70 

TCG CTC TTC GTC TTC AGC GAS GAG AAC GTC GTC CGC AAA TAG GCG AAG 410 
Ser Leu Phe Val Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys 
75 80 85 

CGC ATC ACC GAG TGG CCT CCA TTC GAG AAT ATG ATC CTG GCC ACC ATC 458 
Arg He Thr Glu Trp Pro Pro Phe Glu Asn Met He Leu Ala Thr He 
90 95 100 105 

ATC GCC AAC TGC ATC GTG CTG GCC CTG GAG CAG GAG CTC CCT GAT G6G 506 
He Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Asp Gly 
110 115 120 

GAG AAA AGG CGG ATG TCG GAG CGG CTG GAG GAG ACG GAG GCC TAT TTC 554 
Asp Lys Thr Pro Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe 
125 130 135 

ATG GGG ATC TTT TGC TTC GAG GCA GGG ATC AAA ATC ATC GCT CTG 6GC 602 
He Gly He Phe Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly 
140 145 150 

TTT GTC TTC CAG AAG GGG TCT TAG CTG CGG AAG GGG TGG AAC GTG ATG 650 
Phe Val Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met 
155 160 165 

GAG TTC GTG GTG GTC CTC ACA GGG ATC CTT GCC ACG GCT GGA ACT GAG 698 
Asp Phe Val Val Val Leu Thr Gly He Leu Ala Thr Ala Gly Thr Asp 
170 175 180 185 

TTC GAC GTG GGA ACA CTG AGG GCT GTG CGT GTG CTG AGG CGC CTG AAG 746 
Phe Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys 
190 195 200 

CTG GTG TCT GGG ATT CCA AGT TTG CAG GTG GTG CTC AAG TGC ATC ATG 794 
Leu Val Ser Gly He Pro Ser Leu Gin Val Val Leu Lys Ser He Net 
205 210 215 

AAG GCC ATG GTT CCA CTC CTG CAG ATT GGG CTG CTT CTC TTC TTT GCC 842 
Lys Ala Met Val Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala 
220 225 230 

ATC CTC ATG TTT GCC ATC ATT GGC CTG GAG TTC TAG ATG GGC AAG TTC 890 
He Leu Met Phe Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe 
235 240 245 

GAC AAG GCC TGT TTC GCC AAC AGC ACA GAT GCG GAG CCC GTG GGT GAC 938 
His Lys Ala Cys Phe Pro Asn Ser Thr Asp Ala Glu Pro Val Gly Asp 
250 255 260 265 
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TTC CCC TGT GGC AAG GAG GCC CCA GCC CGG CT6 TGC GAG GGC GAC ACT 9B6 
Phe Pro Cys Gly Lys Glu Ala Pro Ala Arg Leu Cys Glu Gly Asp Thr 
270 275 280 

GAG TGC CGG GAG TAC TGG CCA GQA CCC AAC TTT GGC ATC ACC AAC TTT 1034 
Glu Cys Arg Glu Tyr Trp Pro Gly Pro Asn Phe Gly lie Thr Asn Phe 
285 290 295 

GAC AAT ATC CTG TTT GCC ATC TTG ACG GTG TTC CAG TGC ATC ACC ATG 1082 
Asp Asn lie Leu Phe Ala lie Leu Thr Val Phe Gin Cys He Thr Met 
300 305 310 

GAG GGC TGG ACT GAC ATC CTC TAT AAT ACA AAC GAT GCG GCC GGC AAC 1130 
Glu Gly Trp Thr Asp He Leu Tyr Asn Thr Asn Asp Ala Ala Gly Asn 
315 320 325 

ACC TGG AAC TGG CTC TAC TTC ATC CCT CTC ATC ATC ATC GGC TCC TTC 1178 
Thr Trp . Asn Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe 
330 335 340 345 

TTC ATG CTC AAC CTG GTG CTG GGC GTG CTC TCG GGG GAG TTT GCC AAG 1226 
Phe Net Leu Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys 
350 355 360 

GAG CGA GAG AGG GTG GAG AAC CGC CGC GCC TTC CTG AAG CTG CGC CGG 1274 
Glu Arg Glu Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg 
365 370 375 

CAG CAG CAG ATC GAG CGA GAG CTC AAC GGG TAC CTG GAG TGG ATC TTC 1322 
Gin Gin Gin He Glu Arg Glu Leu Asn Gly Tyr Leu Glu Trp He Phe 
380 385 390 

AAG GCG GAG GAA GTC ATG CTG GCC GAG GAG GAC AGG AAT GCA GAG GAG 1370 
Lys Ala Glu Glu Val Met Leu Ala Glu Glu Asp Arg Asn Ala Glu Glu 
395 400 405 

AAG TCC CCT TTG GAC GTG CTG AAG AGA GCG GCC ACC AAG AAG AGC AGA 1418 
Lys Ser Pro Leu Asp Val lieu Lys Arg Ala Ala Thr Lys Lys Ser Arg 
410 415 420 425 

AAT GAC CTG ATC CAC OCA GAG GAG GGA GAG GAC CGG TTT GCA GAT CTC 1466 
Asn Asp Leu He His Ala Glu Glu Gly Glu Asp Arg Phe Ala Asp Leu 
430 435 440 

TGT GCT GTT GGA TCC CCC TTC GCC CGC GCC AGC CTC AAG AGC GGG AAG 1514 
Cys Ala Val Gly Ser Pro Phe Ala Arg Ala Ser Leu Lys Ser Gly Lys 
445 450 455 

ACA GAG AGC TCG TCA TAC TTC CGG AGG AAG GAG AAG ATG TTC CGG TTT 1562 
Thr Glu Ser Ser Ser Tyr Phe Arg Arg Lys Glu Lys Met Phe Arg Phe 
460 465 470 

TTT ATC CGG CGC ATG GTG AAG GCT CAG AGC TTC TAC TGG GTG GTG CTG 1610 
Phe He Arg Arg Met Val Lys Ala Gin Ser Phe Tyr Trp Val Val Leu 
475 480 485 
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TGC 6TG 6TG GCC CTG AAC ACA CT6 T6T 6TG 6CC ATG GT6 CAT TAG AAC 1658 
Cys Val Val Ala Leu Asn Thr Leu Cys Val Ala Met Val His Tyr Asn 
490 495 500 505 

GAG CCG CXK3 CGG CTT ACC ACG ACC CTG TAT TTT GCA GAG TTT GTT TTC 1706 
Gin Pro Arg Arg Leu Thr Thr Thr Leu Tyr Phe Ala Glu Phe Val Phe 
510 515 520 

CTG GGT CTC TTC CTC ACA GAG ATG TCC CTG AAG ATG TAT GGC CTG GGG 1754 
Leu Gly Leu Phe Leu Thr Glu Met Ser Leu Lys Net Tyr Gly Leu Gly 
525 530 535 

CCC AGA AGC TAC TTC CGG TCC TCC TTC AAC TGC TTC GAC TTT GGG GTC 1802 
Pro Arg Ser Tyr Phe Arg Ser Ser Phe Asn Cys Phe Asp Phe Gly Val 
540 545 550 

ATC GTG GGG AGC GTC TTT GAA GTG GTC TGG GCG GCC ATC AAG CCG GGA 1850 
lie Val Gly Ser Val Phe Glu Val Val Trp Ala Ala lie Lys Pro Gly 
555 560 565 

AGC TCC TTT GGG ATC AGT GTG CTG CGG GCC CTC CGC CTG CTG A6G ATC 1898 
Ser Ser Phe Gly lie Ser Val Leu Arg Ala Leu Arg Leu Leu Arg lie 
570 575 580 585 

TTC AAA GTC ACG AAG TAC TGG AGC TCC CTG CGG AAC CTG GTG GTG TCC 1946 
Phe Lys Val Thr Lys Tyr Trp Ser Ser Leu Arg Asn Leu Val Val Ser 
590 595 600 

CTG CTG AAC TCC ATG AAG TCC ATC ATC AGC CTG CTC TTC TTG CTC TTC 1994 
Leu Leu Asn Ser Met Lys Ser lie lie Ser Leu Leu Phe Leu Leu Phe 
605 610 615 

CTG TTC ATT GTG GTC TTC GCC CTG CTG GGG ATG GAG CTG TTT GGG GGA 2042 
Leu Phe lie Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly 
620 625 630 

GAG TTC AAC TTC GAG GAT GAG ACT CGC ACA ACC AAC TTG GAG ACC TTC 2090 
Gin Phe Asn Phe Gin Asp Glu Thr Pro Thr Tbr Asn Phe Asp Thr PJie 
635 640 645 

CCT GCC GGC ATC GTC ACT GTC TTC CAG ATC CTG ACG GGA GAG GAC TGG 2138 
Pro Ala Ala He Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp Txp 
650 655 660 665 

AAT GCA GTG ATG TAT GAG GGG ATC GAA TGG CAA GGG GGC GTC AGC AAA 2186 
Asn Ala Val Met Tyr His Gly He Glu Ser Gin Gly Gly Val Ser Lys 
670 675 680 

GGC ATG TTC TGG TGG TTT TAC TTC ATT GTC CTG ACA CTG TTC GGA AAC 2234 
Gly Met Phe Ser Ser Phe Tyr Phe He Val Leu Thr Leu Phe Gly Asn 
685 690 695 

TAC ACT CTG CTG AAT GTC TTT CTG GGG ATC GGT GTG GAC AAG GTG GGC 2282 
Tyr Thr Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala 
700 705 710 
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AAC QCC CAA GAG CTG ACC AAG GAT GAA GAG GAG ATG GAA GAA GCA GCC 2330 
Asn Ala Gin Glu Leu Tlir Lys Asp Glu Glu Glu Met Glu Glu Ala Ala 

715 720 725 

AAT CAG AAG CTT GOT CTG CAA AAG GCC AAA GAA GTG GOT GAA GTC AGC 237B 
Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val Ser 
730 735 740 745 

CCC ATG TCT GCC GCG AAC ATC TCC ATC GCC GCC AGG CAG CAS AAC TCG 2426 
Pro Met Ser Ala Ala Asn He Ser He Ala Ala Arg Gin Gin Asn Ser 
750 755 760 

GCC AAG GCG CGC TCG GTG TGG GAG CAG CGG GCC AGC CAG CTA CGG CTG 2474 
Ala Lys Ala Arg Ser Val Tip Glu Gin Arg Ala Ser Gin Leu Arg Leu 
765 770 775 

CAG AAC CTG CGG GCC AGC TGC GAG GCG CTG TAG AGC GAG ATG GAC CCC 2522 
Gin Asn Leu Arg Ala Ser Cys Glu Ala Leu Tyr Ser Glu Met Asp Pro 
780 785 790 

GAG GAG CGG CTG CGC TTC GCC ACT ACG CGC CAC CTG CGG CCC GAC ATG 2570 
Glu Glu Arg Leu Arg Phe Ala Thr Thr Arg His Leu Arg Pro Asp Met 
795 800 805 

AAG ACG CAC CTG GAC CGG CCG CTG GTG GTG GAG CTG GGC CGC GAC GGC 2618 
Lys Thr His Leu Asp Arg Pro Leu Val Val Glu Leu Gly Arg Asp Glv 
810 815 820 ' 

GCG CGG GGG CCC GTG GGA GGC AAA GCC CGA CCT GAG GCT GCG GAG GCC 2666 
Ala Arg Gly Pro Val Gly Gly Lys Ala Arg Pro Glu Ala Ala Glu Ala 
830 835 840 

CCC GAG GGC GTC GAC CCT CCG CGC AGG CAC CAC CGG CAC CGC GAC AAG 2714 
Pro Glu Gly Val Asp Pro Pro Arg Arg His His Arg His Arg Asp Lys 
845 850 855 

GAC AAG ACC CCC GCG GCG GGG GAC CAG GAC CGA GCA GAG GCC CCG AAG 2762 
Asp Lys Thr Pro Ala Ala Gly Asp Gin Asp Arg Ala Glu Ala Pro Lys 
860 865 870 

GCG GAG AGC GGG GAG CCC GGT GCC COG GAG GAG CGG CCG CGG CCG CAC 2810 
Ala Glu Ser Gly Glu Pro Gly Ala Arg Glu Glu Arg Pro Arg Pro His 
875 880 885 

CGC AGC CAC AGC AAG GAG GCC GCG GGG CCC CCG GAG GCG CGG AGC GAG 2858 
Arg Ser His Ser Lys Glu Ala Ala Gly Pro Pro Glu Ala Arg Ser Glu 
890 895 900 905 

CGC GGC CGA GGC CCA GGC CCC GAG GGC GGC CGG CGG CAC CAC CGG CGC 2906 
Arg Gly Arg Gly Pro Gly Pro Glu Gly Gly Arg Arg His His Arg Arg 
910 915 920 

GGC TCC CCG GAG GAG GCG GCC GAG CGG GAG CCC CGA CGC CAC CGC GCG 2954 
Gly Ser Pro Glu Glu Ala Ala Glu Arg Glu Pro Arg Arg His Arg Ala 
925 930 935 
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CAC CGG CAC CAG GAT CCG AGC AAG GAG TGC GCC GGC GCC AAG GGC GAG 3002 
His Arg His Gin Asp Pro Ser Lys Glu Cys Ala Gly Ala Lys Gly Glu 
940 945 950 

CGG CGC GCG CGG CAC CGC GGC GGC CCC CQA GCG GGG CCC CGG GAG GCG 3050 
Arg Arg Ala Arg His Arg Gly Gly Pro Arg Ala Gly Pro Arg Glu Ala 
955 960 965 

GAG AGC GGG GAG GAG CCG GCG CGG CGG CAC COG GCC CGG CAC AAG GCG 3098 
Glu Ser Gly Glu Glu Pro Ala Arg Arg His Arg Ala Arg His Lys Ala 
970 975 980 985 

CAG CCT GCT CAC GAG GCT GTG GAG AAG GAG ACC AOG GAG AAG GAG GCC 3146 
Gin Pro Ala His Glu Ala Val Glu Lys Glu Thr Thr Glu Lys Glu Ala 
990 995 1000 

ACG GAG AAG GAG GCT GAG ATA GTG GAA GCC GAC AAG GAA AAG GAG CTC 3194 
Thr Glu Lys Glu Ala Glu He Val Glu Ala Asp Lys Glu Lys Glu Leu 
1005 1010 1015 

COG AAC CAC CAG CCC CGG GAG CCA CAC TGT GAC CTG GAG ACC AGT GGG 3242 
Arg Asn His Gin Pro Arg Glu Pro His Cys Asp Leu Glu Thr Ser Gly 
1020 1025 1030 

ACT GTG ACT GTG 6GT CCC ATG CAC ACA CTG CCC AGC ACC TOT CTC CAG 3290 
Thr Val Thr Val Gly Pro Met His Thr Leu Pro Ser Thr Cys Leu Gin 
1035 1040 1045 

AAG GTG GAG GAA CAG CCA GAG GAT GCA GAC AAT CAG CGG AAC GTC ACT 3338 
Lys Val Glu Glu Gin Pro Glu Asp Ala Asp Asn Gin Arg Asn Val Thr 
1050 1055 1060 1065 

CGC ATG GGC AGT CAG CCC CCA GAC CCG AAC ACT ATT GTA CAT ATC CCA 3386 
Arg Met Gly Ser Gin Pro Pro Asp Pro Asn Thr He Val His He Pro 
1070 1075 1080 

GTG ATG CTG KG GGC CCT CTT GGG GAA GCC ACG GTC GTT CCC AGT GGT 3434 
Val Met Leu Thr Gly Pro Leu Gly Glu Ala Thr Val Val Pro Ser Gly 
1085 1090 1095 

AAC GTG GAC CTG GAA AGC CAA GCA GAG GGG AAG AAG GAG GTG GAA GCG 3482 
Asn Val Asp Leu Glu Ser Gin Ala Glu Gly Lys Lys Glu Val Glu Ala 
1100 1105 1110 

GAT GAC GTG ATG AGG AGC GGC CCC CGG CCT ATC GTC CCA TAC AGC TCC 3530 
Asp Asp Val Met Arg Ser Gly Pro Arg Pro He Val Pro Tyr Ser Ser 
1115 1120 1125 

ATG TTC TGT TTA AGC CCC ACC AAC CTG CTC CGC CGC TTC TGC CAC TAC 3578 
Met Phe Cys Leu Ser Pro Thr Asn Leu Leu Arg Arg Phe Cys His Tyr 
1130 1135 1140 1145 

ATC GTG ACC ATG AGG TAC TTC GAG GTG GTC ATT CTC GTG GTC ATC GCC 3626 
He Val Thr Met Arg Tyr Phe Glu Val Val He Leu Val Val He Ala 
1150 1155 1160 
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TTG AGC AGC ATC GCC CTG GCT GCT GAG GAC CCA GTG CGC ACA GAC TCG 3674 
Leu Ser Ser He Ala Leu Ala Ala Glu Asp Pro Val Arg Thr Asp fier 
1165 1170 1175 

CCC AGG AAC AAC GCT CTG AAA TAC CTG GAT TAC ATT TTC ACT GGT GTC 3722 
Pro Arg Asn Asn Ala Leu Lys Tyr Leu Asp Tyr He Phe Thr Gly Val 
1180 1185 1190 

TTT ACC TTT GAG ATG GTG ATA AAG ATG ATC GAC TTG GGA CTG CTG CTT 3770 
Phe Thr Phe Glu Met Val He Lys Met He Asp Leu Gly Leu Leu Leu 
11S5 1200 1205 

CAC CCT GGA GCC TAT TTC CGG GAC TTG TGG AAC ATT CTG GAC TTC ATT 3818 
His Pro Gly Ala Tyr Phe Arg Asp Leu Trp Asn He Leu Asp Phe He 
"10 1215 1220 1225 

GTG GTC AGT GGC GCC CTG GTG OCG TTT GCT TTC TCA GGA TCC AAA GGG 3866 
Val Val Ser Gly Ala Leu Val Ala Phe Ala Phe Ser Gly Ser Lys Gly 
1230 1235 1240 

AAA GAC ATC AAT ACC ATC AAG TCT CTG A6A GTC CTT CGT GTC CTG CGG 3914 
Lys Asp He Asn Thr He Lys Ser Leu Arg Val Leu Arg Val Leu Arg 
1245 1250 1255 

CCC CTC AAG ACC ATC AAA COG CTG CCC AAG CTC AAG GCT GTG TTT GAC 3962 
Pro Leu Lys Thr He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp 
1260 1265 1270 

TGT GTG GTG AAC TCC CTG AAG AAT GTC CTC AAC ATC TTG ATT GTC TAC 4010 
Cys Val Val Asn Ser Leu Lys Asn Val Leu Asn He Leu He Val Tyr 
1275 1280 1285 



ATG CTC TTC ATG TTC ATA TTT GCC GTC ATT GCG GTG CAG CTC TTC AAA 4058 
Met Leu Phe Met Phe He Phe Ala Val He Ala Val Gin Leu Phe Lys 
1250 1295 1300 1305 

GGG AAG TTT TTC TAC TGC ACA GAT GAA TCC AAG GAG CTG GAG AGG GAC 4106 
Gly Lys Phe Phe Tyr Cys Thr Asp Glu Ser Lys Glu Leu Glu Arg Asp 
1310 1315 1320 

TGC AGG GGT CAG TAT TTG GAT TAT GAG AAG GAG GAA GTG GAA GCT CAG 4154 
Cys Arg Gly Gin Tyr Leu Asp Tyr Glu Lys Glu Glu Val Glu Ala Gin 
1325 1330 1335 

CCC AGG CAG TGG AAG AAA TAC GAC TTT CAC TAC GAC AAT GTG CTC TGG 4202 
Pro Arg Gin Trp Lys Lys Tyr Asp Phe His Tyr Asp Asn Val Leu Trp 
1340 1345 1350 

GCT CTG CTG ACG CTG TTC ACA GTG TCC ACG GGA GAA GGC TGG CCC ATG 4250 
Ala Leu Leu Thr Leu Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Met 
1355 1360 1365 

GTG CTG AAA CAC TCC GTG GAT GCC ACC TAT GAG GAG CAG GGT CCA AGC 4298 
Val Leu Lys His Ser Val Asp Ala Thr Tyr Glu Glu Gin Gly Pro Ser 
3.370 1375 1380 1385 
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CCT G6G TAC CGC ATG GAG CTG TCC ATC TTC TAG GTG GTC TAG TTT QTG 4346 
Pro Gly Tyr Arg Met Glu Leu Ser lie Pbe Tyr Val Val Tyr Phe Val 
1390 1395 1400 

GTC TTT CCC TTC TTC TTC GTC AAC ATC TTT GTG GCT TTG ATC ATC ATC 4394 
Val Phe Pro Phe Phe Phe Val Asn lie Phe Val Ala Xieu lie He He 
1405 1410 1415 

ACC TTC CAG GAG CAG GGG GAC AAG GTG ATG TCT GAA TGC AGC CTG GAG 4442 
Thr Phe Gin Glu Gin Gly Asp Lys Val Net Ser Glu Cys Ser Leu Glu 
1420 1425 1430 

AAG AAC GAG AGG GCT TGC ATT GAC TTC GCC ATC AGC GCC AAA CCC CTG 4490 
Lys Asn Glu Arg Ala Cys He Asp Phe Ala He Ser Ala Lys Pro Leu 
1435 1440 1445 

ACA CGG TAC ATG CCC CAA AAC CGG CAG TCG TTC CAG TAT AAG ACG TGG 4538 
Thr Arg Tyr Met Pro Gin Asn Arg Gin Ser Phe Gin Tyr Lys Thr Trp 
1450 1455 1460 1465 

ACA TTT GTG GTC TCC CCG CCC TTT GAA TAC TTC ATC ATG GCC ATG ATA 4586 
Thr Phe Val Val Ser Pro Pro Phe Glu Tyr Phe He Met Ala Met He 
1470 1475 1480 

GCC CTC AAC ACT GTG GTG CTG ATG ATG AAG TTC TAT GAT GCA CCC TAT 4634 
Ala lieu Asn Thr Val Val Leu Met Met Lys Phe Tyr Asp Ala Pro Tyr 
1485 1490 1495 

GAG TAC GAG CTG ATG CTG AAA TGC CTG AAC ATC GTG TTC ACA TCC ATG 4682 
Glu Tyr Glu Leu Met Leu Lys Cys Leu Asn He Val Phe Thr Ser Met 
1500 1505 1510 

TTC TCC ATG GAA TGC GTG CTG AAG ATC ATC GCC TTT GGG GTG CTG AAC 4730 
Phe Ser Met Glu Cys Val Leu Lys He He Ala Phe Gly Val Leu Asn 
1515 1520 1525 

TAT TTC AGA GAT GCC TGG AAT GTC TTT GAC TTT GTC ACT GTG TTG QGA 4778 
Tyr Phe Arg Asp Ala Trp Asn Val Phe Asp Phe Val Thr Val Leu Gly 
1530 1535 1540 1545 

AGT ATT ACT GAT ATT TTA GTA ACA GAG ATT GCQ GAA ACG AAC AAT TTC 4826 
Ser He Thr Asp He Leu Val Thr Glu He Ala Glu Thr Asn Asn Phe 
1550 1555 1560 

ATC AAC CTC AGC TTC CTC CGC CTC TTT CGA GCT GCG CGG CTG ATC AAG 4874 
He Asn Leu Ser Phe Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys 
1565 1570 1575 

CTG CTC CGC CAG GGC TAC ACC ATC CGC ATC CTG CTG TGG ACC TTT GTC 4922 
Leu Leu Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val 
1580 1585 1590 

CAG TCC TTC AAG GCC CTG CCC TAC GTG TGT CTG CTC ATT GCC ATG CTG 4970 
Gin Ser Phe Lys Ala Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu 
1595 1600 1605 
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TTC TTC ATC TAG GCC ATC ATC GGC ATO GAG GTG TTT GGG AAT ATT GCC 5018 
Phe Phe He Tyr Ala He He Gly Met Gin Val Phe Gly Asn He Ala 
3.610 1615 1620 1625 

CTG GAT GAT GAC ACC AGC ATC AAC CGC CAC AAC AAC TTC CGG ACG TTT 5066 
Leu Asp Asp Asp Thr Ser He Asn Arg His Asn Asn Phe Arg Thr Phe 
1630 1635 1640 

TTG CAA GCC CTG ATG CTG CTG TTC AGG AGC GCC ACG GGG GAG GCC T6G 5114 
Leu Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp 
1645 1650 1655 

CAC GAG ATC ATG CTG TCC TGC CTG AGC AAC CAG GCC TGT GAT GAG CAG 5162 
His Qlu He Met Leu Ser Cys Leu Ser Asn Gin Ala Cys Asp Glu Gin 
1660 1665 1670 

GCC AAT GCC ACC GAG TGT GGA AGT GAC TTT GCC TAC TTC TAC TTC 6TC 5210 
Ala Asn Ala Thr Glu Cys Gly Ser Asp Phe Ala Tyr Phe Tyr Phe Val 
1675 1680 1685 

TCC TTC ATC TTC CTG TGC TCC TTT CTG ATG TTG AAC CTC TTT GTG GCT 5258 
Ser Phe He Phe Leu Cys Ser Phe Leu Met Leu Asn Leu Phe Val Ala 
1690 1695 1700 1705 

GTG ATC ATG GAC AAT TTT GAG TAC CTC ACG CGG GAC TCT TCC ATC CTA 5306 
Val He Met Asp Asn Phe Glu Tyr Leu Thr Arg Asp Ser Ser He Leu 
1710 1715 1720 

GGT CCT CAC CAC TTG GAT GAG TTC ATC CGG GTC TGG GCT GAA TAC GAC 5354 
Gly Pro His His Leu Asp Glu Phe He Arg Val Trp Ala Glu Tyr Asp 
1725 1730 1735 

CCG GCT GCG TGT GGG CGC ATC AGT TAC AAT GAC ATG TTT GAG ATG CTG 5402 
Pro Ala Ala Cys Gly Arg He Ser Tyr Asn Asp Met Phe Glu Met Leu 
1740 1745 1750 

AAA CAC ATG TCC CCQ CCT CTG GGG CTG GGG AAG AAA TGC CCT GCT CGA 5450 
Lys His Met Ser Pro Pro Leu Gly Leu Gly Lys Lys Cys Pro Ala Arg 
1755 1760 1765 

GTT GCT TAC AAG CGC CTG GTT CGC ATG AAC ATG CCC ATC TCC AAC GAG 5498 
Val Ala Tyr Lys Arg Leu Val Arg Met Asn Met Pro He Ser Asn Glu 
1770 1775 1780 1785 

GAC ATG ACT GTT CAC TTC ACG TCC ACG CTG ATG GCC CTC ATC CGG ACG 5546 
Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala Leu He Arg Thr 
1790 1795 1800 

GCA CTG GAG ATC AAG CTG GCC CCA GCT GGG ACA AAG CAG CAT CAG TGT 5594 
Ala Leu Glu He Lys Leu Ala Pro Ala Gly Thr Lys Gin His Gin Cys 
1805 1810 1815 

GAC GCG GAG TTG AGG AAG GAG ATT TCC GTT GTG TGG GCC AAT CTG CCC 5642 
Asp Ala Glu Leu Arg Lys Glu He Ser Val Val Trp Ala Asn Leu Pro 
1820 1825 1830 
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CAG AAG ACT TTG GAC TTG CTG 6TA CCA CCC CAT AA6 CCT GAT GAG ATG 5690 
Gin Lys Thr Leu Asp Leu Leu Val Pro Pro His Lys Pro Asp Glu Met 
1835 1840 1845 

ACA GTG 6GG AAG GTT TAT GCA GCT CTG ATG ATA TTT GAC TTC TAG AAG 5738 
Thr Val Gly Lys Val Tyr Ala Ala Leu Met lie Phe Asp Phe Tyr Lys 
1850 1855 1860 1865 

CAG AAC AAA ACC ACC AGA GAC CAG ATG CAG CAG GCT CCT GGA GGC CTC 5786 
Gin Asn Lys Thr Thr Arg Asp Gin Met Gin Gin Ala Pro Gly Gly Leu 
1870 1875 1880 

TCC CAG ATG GGT CCT GTG TCC CTG TTC CAC CCT CTG AAG GCC ACC CTG 5834 
Ser Gin Met Gly Pro Val Ser Leu Phe His Pro Leu Lys Ala Thr Leu 
1885 1890 1895 

GAG CAG ACA CAG CCG GCT GTG CTC CGA GGA GCC CGG GTT TTC CTT CGA 5882 
Glu Gin Thr Gin Pro Ala Val Leu Arg Gly Ala Arg Val Phe Leu Arg 
1900 1905 1910 

CAG AAG AGT TCC ACC TCC CTC AGC AAT GGC GGG GCC ATA CAA AAC CAA 5930 
Gin Lys Ser Ser Thr Ser Leu Ser Asn Gly Gly Ala lie Gin Asn Gin 
1915 1920 1925 

GAG AGT GGC ATC AAA GAG TCT GTC TCC TGG GGC ACT CAA AGG ACC CAG 5978 
Glu Ser Gly lie Lys Glu Ser Val Ser Trp Gly Thr Gin Arg Thr Gin 
1930 1935 1940 1945 

GAT GCA CCC CAT GAG GCC AGG CCA CCC CTG GAG CGT GGC CAC TCC ACA 6026 
Asp Ala Pro His Glu Ala Arg Pro Pro IjCU Glu Arg Gly His Ser Thr 
1950 1955 1960 

GAG ATC CCT GTG GGG CGG TCA GGA GCA CTG GCT GTG GAC GTT CAG ATG 6074 
Glu lie Pro Val Gly Arg Ser Gly Ala Leu Ala Val Asp Val Gin Met 
1965 1970 1975 

CAG AGC ATA ACC CGG AGG GGC CCT GAT GGG GAG CCC CAG CCT GGG CTG 6122 
Gin Ser lie Thr Arg Arg Gly Pro Asp Gly Glu Pro Gin Pro Gly Leu 
1980 1985 1990 

GAG AGC CAG GGT CGA GCG GCC TCC ATG CCC CGC CTT GCG GCC GAG ACT 6170 
Glu Ser Gin Gly Arg Ala Ala Ser Met Pro Arg Leu Ala Ala Glu Thr 
1995 2000 2005 

CAG CCC GTC ACA GAT GCC AGC CCC ATG AAG CGC TCC ATC TCC ACG CTG 6218 
Gin Pro Val Thr Asp Ala Ser Pro Met Lys Arg Ser lie Ser Thr Leu 
2010 2015 2020 2025 

GCC CAG CGG CCC CGT GGG ACT CAT CTT TGC AGC ACC ACC CCG GAC CGC 6266 
Ala Gin Arg Pro Arg Gly Thr His Leu Cys Ser Thr Thr Pro Asp Arg 
2030 2035 2040 

CCA CCC CCT AGC CAG GCG TCG TCG CAC CAC CAC CAC CAC CGC TGC CAC 6314 
Pro Pro Pro Ser Gin Ala Ser Ser His His His His His Arg Cys His 
2045 2050 2055 
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CGC CGC A6G GAC A6G AA6 CA6 AG6 TCC CTG GAG AAG GGG CCC AGC CT6 6362 
Arg Arg Axg Asp Arg Lys Gin Arg Ser Leu Glu Lys Gly Pro Ser Leu 
2060 2065 2070 

TCT 6CC GAT ATG GAT GGC GCA CCA AGC AGT GCT QTG GGG CCG GGG CTG 6410 
Ser Ala Asp Net Asp Gly Ala Pro Ser Ser Ala Val Gly Pro Gly Leu 
2075 2080 2085 

CCC CCG GGA GAG GGG CCT ACA GGC TGC CGG CGG GAA CGA GAG CGC CGG 6458 
Pro Pro Gly Glu Gly Pro Thr Gly Cya Arg Arg Glu Arg Glu Arg Arg 
2090 2095 2100 2105 

CAG GAG CGG GGC CGG TCC CAG GAG CGG AGG CAG CCC TCA TCC TCC TCC 6506 
Gin Glu Arg Gly Arg Ser Gin Glu Arg Arg Gin Pro Ser Ser Ser Ser 
2110 2115 2120 

TCG GAG AAG CAG CGC TTC TAG TCC TGC GAC CGC TTT GGG GGC CGT GAG 6554 
Ser Glu Lys Gin Arg Phe Tyr Ser Cys Asp Arg Phe Gly Gly Arg Glu 
2125 2130 2135 

CCC CCG AAG CCC AAG CCC TCC CTC AGC AGC CAC CCA ACG TCG CCA ACA 6602 
Pro Pro Lys Pro Lys Pro Ser Leu Ser Ser His Pro Thr Ser Pro Thr 
2140 2145 2150 

GCT GGC CAG GAG CCG GGA CCC CAC CCA CAG GCC GGC TCA GCC GTG GGC 6650 
Ala Gly Gin Glu Pro Gly Pro His Pro Gin Ala Gly Ser Ala Val Gly 
2155 2160 2165 

TTT CCG AAC ACA ACG CCC TGC TGC AGA GAG ACC CCC TCA GCC AGC CCC 6698 
Phe Pro Asn Thr Thr Pro Cys Cys Arg Glu Thr Pro Ser Ala Ser Pro 
2170 2175 2180 2185 

TGG CCC CTG GCT CTC GAA TTG GCT CTG ACC CTT ACC TGG GGC AGC GTC 6746 
Trp Pro Leu Ala Leu Glu Leu Ala Leu Thr Leu Thr Trp Gly Ser Val 
2190 2195 2200 

TGG ACA GTG AGG CCT CTG TCC ACG CCC TGC CTG AGG ACA CGC TCA CTT 6794 
Trp Thr Val Arg Pro Leu Ser Thr Pro Cys Leu Arg Thr Arg Ser Leu 
2205 2210 2215 

TCG AGG AGG CTG TGG CCA CCA ACT CGG GCC GCT CCT CCA GGA CTT CCT 6842 
Ser Arg Arg lieu Trp Pro Pro Thr Arg Ala Ala Pro Pro Gly lieu Pro 
2220 2225 2230 

ACG TGT CCT CCC TGACCTCCCA QTCTCACCCT CTCCGC06CG TGCCCAACGG 6894 
Thr Cys Pro Pro 
2235 

TTACCACTGC ACCCTGGGAC TCAGCTCGGG TGGCCGAGCA CGGCACAGCT ACCACCACCC 6954 

TGACCAAGAC CACTGGTGCT AGCT6CACCG TGACCGCTCA GACGCCTGCA TGCAGCAGGC 7014 

- GTGTGTTCCA 6TGGATQAGT TTTATCATCC ACACGGGGCA GTGGGCCCTC GGGGGAGGCC 7074 



TTGCCCACCT TGGTGAGGCT CCTGTGGCCC CTCCCTCCCC CTCCTCCCCT CTTTTACTCT 7134 
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AGACGACOAA TAAAGCCCTG TTGCTTGAGT GTACGTACCG C 7175 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1437 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1435.. 1546 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

mI^ f,^? ^ ^ ^ CCT TAC CCA CCC TCC CAG 48 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
^ 5 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCQ CAG GGC AAA TAC AGC 96 
Glu lie Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 



30 



AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GOG AGC ACG TCC TCG GAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 



45 



^^C ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA QCG GAG TCC TAC ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 240 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
^5 70 75 80 



TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 288 
Leu Arg Lys Glu XLa Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 336 
Lys Thr Lys Pro Val Ma Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala lie Thr Phe 
115 120 125 
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GA6 CCC AAA GAC TTC CTG CAC ATC AA6 GAG AAA TAC AAT AAT GAC TOG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 460 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA GAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 672 
Ser Ala Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 

GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 720 
Val Val Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA OCT TTA TTT GAC TTC TTG 768 
Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT 816 
Lys His Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
260 265 270 

ATT TCC CTG GCT AAG CGC TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 864 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He 
275 280 285 

ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 912 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
290 295 300 

GAA ATC GAG CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT 960 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Ala 
305 310 315 320 

CTG GAT GCT GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC AAG ACC TCG 1008 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 

CTG GCC CCC ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT 1056 
Leu Ala Pro He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu 
340 345 350 
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CAA AGG CTC ATC AAQ TCC C6A GGA AA6 TCT CAG TCC AAA CAC CTC AAT 1104 
Gin Arg Leu lie Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA 6C6 GCC TC6 GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 1152 
Val Gin He Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 1200 
Phe Asp lie He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 400 

CTG GCG GAG TAG TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 1248 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 1296 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GTA CAG GTG CTC 1344 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Val Gin Val Leu 
435 440 445 

ACC TCG CTC AGG AGA AAC CTC GGC TTC TGG GGC GGG CTG GAG TCC TCA 1392 
Thr Ser Leu Arg Arg Asn Leu Gly Phe Trp Gly Gly Leu Glu Ser Ser 
450 455 460 

CAG CGG GGC AGT GTG GTG CCC CAG GAG CAG GAA CAT GCC ATG TAGTGGGCGC 1444 
Gin Arg Gly Ser Val Val Pro Gin Glu Gin Glu His Ala Met 
465 470 475 

CCTGCCCGTC TTCCCTCCTG CTCTGGGGTC GGAACTGGAG TGCAGGGAAC ATGGAGGAGG 1504 
AAGGGAAGAG CTTTATTTTG TAAAAAAATA AGATGA6CGG CA 1546 



(2) INFORMATION FOR SKQ ID MO:10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1797 

(D) OTHER INFORMATION: /standard^name* "Betal-3" 



(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1795. .1851 
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ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC GAG 48 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 10 15 

GAG ATC CCC ATG GGA GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 96 
Glu lie Pro Met Gly Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG QAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 240 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 80 

TTA AGG MiG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 288 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 336 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala lie Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp lie Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 
145 150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser X.eu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GCC AAA CAG AAG CAG AAG TCG ACA GAG CAT GTG CCC CCC TAT GAC 672 
Ser Ala Lys Gin Lys Gin Lys Ser Thr Glu His Val Pro Pro Tyr Asp 
210 215 220 
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GTG GTG CCT TCC ATG AGG CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG 720 
Val Val Pro Ser Met Arg Pro He He Leu Val Gly Pro Ser l-eu Lys 
225 230 235 240 

GGC TAC GAG GTT ACA GAC ATG ATG CAG AAA GCT TTA TTT GAC TTC TTG 768 
Gly Tyr- Glu Val Thr Asp Net Met Gin Lys Ala Leu Phe Asp Phe Leu 
245 250 255 

AAG CAT CGG TTT GAT GGC AGG ATC TCC ATC ACT CGT GTG ACG GCA GAT B16 
Lys His Arg Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp 
2€0 265 270 

ATT TCC CTG GCT AAG C6C TCA GTT CTC AAC AAC CCC AGC AAA CAC ATC 864 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He 
275 280 285 

ATC ATT GAG CGC TCC AAC ACA CGC TCC AGC CTG GCT GAG GTG CAG AGT 912 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
290 295 300 

GAA ATC GAG CGA ATC TTC GAG CTG GCC CGG ACC CTT CAG TTG GTC GCT 960 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr I<eu Gin Leu Val Ala 
305 310 315 320 

CTG GAT GCT GAC ACC ATC AAT CAC CCA GCC CAG CTG TCC MiG ACC TCG 1008 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 

CTG GCC CCC ATC ATT GTT TAC ATC AAG ATC ACC TCT CCC AAG GTA CTT 1056 
Leu Ala Pro He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu 
340 345 350 

CAA AGG CTC ATC AAG TCC CGA GGA AAG TCT CAG TCC AAA CAC CTC AAT 1104 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn 
355 360 365 

GTC CAA ATA GCG GCC TCG GAA AAG CTG GCA CAG TGC CCC CCT GAA ATG 1152 
Val Gin He Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met 
370 375 380 

TTT GAC ATC ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT 1200 
Phe Asp He He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
385 390 395 400 

CTG GCG GAG TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGC 1248 
Leu Ala Glu Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 

AGC ACG CCA CCC AAT CCG CTG CTG AAC CGC ACC ATG GCT ACC GCA GCC 1296 
Ser Thr Pro Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala 
420 425 430 

CTG GCT GCC AGC CCT GCC CCT GTC TCC AAC CTC CAG GGA CCC TAC CTT 1344 
Leu Ala Ala Ser Pro Ala Pro Val Ser Asn Leu Gin Gly Pro Tyr Leu 
435 440 445 
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GCT TCC GGG GAC CAG CCA CTG GAA CGG GCC ACC GGG GAG CAC GCC AGC 1392 
Ala Ser Gly Asp Gin Pro Leu Glu Arg Ala Thr Gly Glu His Ala Ser 
450 455 460 

ATG CAC GAG TAC CCA GGG GAG CTG GGC CAG CCC CCA GGC CTT TAG CCC 1440 
Met His Glu Tyr Pro Gly Glu Leu Gly Gin Pro Pro Gly Leu Tyr Pro 
4€5 470 475 460 

AGC AGC CAC CCA CCA GGC CGG GCA GGC ACG CTA CGG GCA CTG TCC CGC 1488 
Ser Ser His Pro Pro Gly Arg Ala Gly Thr Leu Arg Ala Leu Ser Arg 
485 490 495 

CAA GAC ACT TTT GAT GCC GAC ACC CCC GGC AGC CGA AAC TCT GCC TAC 1536 
Gin Asp Thr Phe Asp Ala Asp Thr Pro Gly Ser Arg Asn Ser Ala Tyr 
500 505 510 

ACG GAG CTG GGA GAC TCA TGT GTG GAC ATG GAG ACT GAC CCC TCA GAG 1584 
Thr Glu Leu Gly Asp Ser Cys Val Asp Met Glu Thr Asp Pro Ser Glu 
515 520 525 

GGG CCA GGG CTT GGA GAC CCT GCA GGG GGC GGC ACG CCC CCA GCC CGA 1632 
Gly Pro Gly Leu Gly Asp Pro Ala Gly Gly Gly Thr Pro Pro Ala Arg 
530 535 540 

CAG GGA TCC TGG GAG GAC GAG GAA GAA GAC TAT GAG GAA GAG CTG ACC 1680 
Gin Gly Ser Tip Glu Asp Glu Glu Glu Asp Tyr Glu Glu Glu Leu Thr 
545 550 555 560 

GAC AAC CGG AAC CGG GGC CGG AAT AAG GCC CGC TAC TGC GCT GM GGT 1728 
Asp Asn Arg Asn Arg Gly Arg Asn Lys Ala Arg Tyr Cys Ala Glu Gly 
565 570 575 

GGG GGT CCA GTT TTG GGG CGC AAC AAG AAT GAG CTG GAG GGC TGG GGA 1776 
Gly Gly Pro Val Leu Gly Arg Asn Lys Asn Glu Leu Glu Gly Trp Gly 
580 585 590 

CGA GGC GTC TAC ATT CGC TGAGAGGCAG GGGCCACACG GCGGGA6GAA 1824 
Arg Gly Val Tyr lie Arg 
595 

GGGCTCTGAG CCCAGGGGAG GGGAGGG 1851 
(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) NOLECDLE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 3310 

(T>) OTHER INFORMATION: /stianda-rH r»amA= 
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(ix) FEATORE: 

(A) KAME/KEY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATX}RE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3308.. 3600 

(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 11: 

GCGGGG6AG6 G06CATTGAT CTTCGATC6C GAAO ATG GOT GOT G6C TGC CTG 52 

Met Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 148 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu lie Ala Ala Arg Asp lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 110 115 

AGC AAT GAA GTT GTC TAG TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin lie Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
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170 175 180 

CTC AAC TGG ACA ACT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 628 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GOT CGA TAT TAT CCA GOT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys lie Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin heu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 



CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 
nin W-iK licjTi rryrr- dlyi TSttt nlv Vm Tlf* fSlTI Trr» Mial- Al^» Pvs Gill Arti 



1252 
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395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu lie Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CM GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 134B 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA 6TC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 1492 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 4B5 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 1540 
Val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys hsn Pro Lys Ser 
520 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 1684 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp 
535 540 545 550 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 1732 
He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu 
555 560 565 

AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 1780 
Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp 
570 575 580 

AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 1828 
Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 1876 
Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala 
600 605 610 

AAA CTA GAA GAG ACA ATA ACT CAG GCC AGA TCA AAA AAG GGC AAA ATG 1924 

r^p^ T f^i.. mv.^ Tim rnVi^ r*!^ -M fi r«»<w. T.*^ T <...*^ r*t^-r T-m Mr%*- 
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615 620 



625 630 



1972 



AAG GAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT 
Lys Asp Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tvr 
635 640 645 

ACA TTC ATA GCA CCA AGA GAT TAG TGC AAT GAG CTG AAA ATA TCG GAT 2020 
Thr Phe lie Ala Pro Arg Asp Tyr Cys Asn Asp Leu Lys lie Ser Asp 
650 655 660 

AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA 2068 
Asn Asn Thr Glu Phe Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys 
665 670 675 

ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG 2116 
Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu 
680 685 690 

CTT GAT Gm OTC m ACA AAT GAA CT^ 2164 
Leu Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys 
695 700, 705 710 

CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT 2212 
Gin Lys Asn He Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly 
715 720 725 

GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA 2260 
Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu 
730 735 740 

AAC CCA GAG ACA TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT 2308 
Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn 
745 750 755 

GAT AAC TAT GTT TTC ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT 2356 
Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly 
760 765 770 

GCC TAT GAA TCG GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT 2404 
Ala Tyr Glu Ser Gly He Met Val Ser Lys Ala Val Glu He Tyr He 
775 780 785 790 

CAA GGG AAA CTT CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA 2452 
Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly He Lys He Asp Val 
795 800 805 

AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT 2500 
Asn Ser Trp He Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys 
810 815 820 

GCT GOT CCA GTT TOT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT 2548 
Ala Gly Pro Val Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys 
825 830 835 

GTG ATT CTG GAT GAT GCT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT 2596 
Val He Leu Asp Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asd 
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840 



845 



850 



TAT ACT AAT CAG ATT GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG 2644 
Tyr Thr Asn Gin He Gly Arg Phe Phe Gly Glu lie Asp Pro Ser Leu 
855 860 865 870 

ATG AGA CAC CTG GTT AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT 2692 
Met Arg His Leu Val Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr 
875 880 865 

GAT TAT CAG TCA GTA TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA 2740 
Asp Tyr Gin Ser Val Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala 
690 895 900 

GGA CAT CGC TCA GCA TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT 2788 
Gly His Arg Ser Ala Tyr Val Pro Ser Val Ala Asp He Leu Gin He 
905 910 915 

GGC TGG TGG GCC ACT GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC 2836 
Gly Trp Trp Ala Thr Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu 
920 925 930 

TTG AGT TTG ACC TTT CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT 2884 
Leu Ser Leu Thr Phe Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp 
935 940 945 950 

GAT GAC TTC ACQ GCC TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA 2932 
Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin 
955 960 965 

ACC CAG TAT TTC TTC GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA 2980 
Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu 
970 975 980 

GAC TGT GGA AAC TGT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC 3028 
Asp Cys Gly Asn Cys Ser Arg He Phe His Gly Glu Lys Leu Met Asn 
985 990 995 

ACC AAC TTA ATA TTC ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT 3076 
Thr Asn Leu He Phe He Met Val Glu Ser Lys Gly Thr Cys Pro Cys 
1000 1005 1010 



GAC ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT 
Asp Thr Arg Leu Leu He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn 
1015 1020 1025 1030 



3124 



CCT TGT GAC ATG GTT AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC 
Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val 
1035 1040 1045 



3172 



TGC TTT GAT AAC AAT GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT 
Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val 
1050 1055 1060 



3220 



TCT GGA TTA AAT CCC TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA 
Ser Glv Leu Asn Pro Ser Leu Trp Tvr He He Glv He Gin Phe Leu 



3268 
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1065 1070 1075 

CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA 
Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leu Leu 
1080 1085 1090 

AAAACCAAAT CTGCATAGTT AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT 

TACAGTAAC6 TAGGGTCAGC TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC 

ATAACACTAA GGCX3CAGACT CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC 

TTAAACGTGT QTGAATGCTG CATCATCTAT GTOTAACATC AAASCAAAAT CCTATACGTG 

TCCTCTATTG GAAAATTTGG GC G TTTGTT G TTGCATTGTT GGT 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CH ARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCCCCTGCCA GTGGCCAAAC AGAAGCAGAA GTCGGGTAAT GAAATGACTA ACTTAGCCTT 
TGAACTAGAC CCCCTAGAGT TAGAGGAGGA AGAGGCTGAG CTTGGTGAGC AGAGTGGCTC 
TGCCAAGACT AGTGTTAGCA GTGTCACCAC CCCGCCACCC CATGGCAAAC GCATCCCCTT 
CTTT3\AGAA6 ACAGA6CATG TGCCCCCCTA TGACGTGGTG CCTTCCATGA GGCCCATCAT 
CCTGGTGGGA CC6TCGCTCA AGGGCTACGA GGTTACAGAC ATGATGCAGA AAGCTTTATT 
TGACTTCTTG AAGCATCGGT TTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCTATTGGTG TAG6TATACC AACAATTAAT TTAAGAAAAA GGAGACCCAA TATCCAG 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IBO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B) LOCATION: 1..132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TGG TCC TTT GCC TGC GCC TGT GCC GCC TTC ATC CTC CTC TTT CTC GGC 
Tzp ser Phe Ala Cys Ala Cys Ala Ala Phe He Leu Leu Phe Leu gS 
^ 5 10 . 15 

GGT CTC GCC CTC CTG CTG TTC TCC CTG CCT CGA ATG CCC CGG AAC CCA 
Gly Leu Ala Leu Leu Leu Phe Ser Leu Pro Arg Met Pro Arg Asn Pro 
20 25 30 

^ e"^^ ^« CAC TAACCCTCCT GCGGCCCTAG 

Tip Glu Ser Cys Met Asp Ala Glu Pro Glu His 
35 40 

CGACCCTCAG GCTTCTTCCC AGGAAGC6G6 6 

(2) INFORMATIOT FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15; 

AATTCGGTAC GTACACTCGA GC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



48 



96 



149 



180 



22 
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GCTCGAGTGT ACGTACCG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTIC»f : SEQ ID NO: 17: 

CCATGGTACC TTCGTTGACG 

(2) INFORMATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AATTCGTCAA C6AAGGTACC ATGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION; 53.. 1504 

(D) OTHER INFORMATION: /standard_namea "Beta-3-1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCGCCTCGGA CCCCCTGTCC CGGGGGAGGG GGAGT^CCCG CTACCCTGGT CT ATG 

Met 
1 

TCT TTT TCT GAC TCC AGT GCA ACC TTC CTG CTG AAC GAG GGT TCA GCC 
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Ser Phe Ser Asp Ser Ser Ala Thr Phe I^eu Leu Asn 61u Gly Ser Ala 
5 10 15 

GAC TCC TAG ACC AGC CGC CCA TCT CTG GAC TCA GAC GTC TCC CTG GAG 151 
Asp Ser Tyr Thr Ser Arg Pro Ser Leu Asp Ser Asp Val Ser Leu Glu 
20 25 30 

GAG GAC COG GAG AGT GCC CGG CGT GAA GTA GAG AGC CAG GCT CAG CAG 199 
Glu Asp Arg Glu Ser Ala Arg Arg Glu Val Glu Ser Gin Ala Gin Qln 
35 40 45 

CAG CTC GAA AGG GCC AAG CAC AAA CCT GTG OCA TTT GCG GTG AGG ACC 247 
Gin Leu Glu Arg Ala Lys His Lys Pro Val Ala Phe Ala Val Arg Thr 
50 55 60 65 

AAT GTC AGC TAC TOT GGC GTA CTG GAT GAG GAG TGC CCA GTC CAG GGC 295 
Asn Val Ser Tyr Cys Gly Val Leu Asp Glu Glu Cys Pro Val Gin Gly 
70 75 80 

TCT GGA GTC AAC TTT GAG GCC AAA GAT TTT CTG CAC ATT AAA GAG AAG 343 
Ser Gly Val Asn Phe Glu Ala Lys Asp Phe Leu His lie Lys Glu Lys 
85 90 95 

TAC AGC AAT GAC TGG TGG ATC GGG CGG CTA GTG AAA GAG GGC GGG GAC 391 
Tyr Ser Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu Gly Gly Asp 
100 105 110 

ATC GCC TTC ATC CCC AGC CCC CAG CGC CTG GAG AGC ATC CGG CTC AAA 439 
He Ala Phe He Pro Ser Pro Gin Arg Leu Glu Ser He Arg Leu Lys 
115 120 125 

CAG GAG CAG AAG GCC AGG AGA TCT GGG AAC CCT TCC AGC CTG AGT GAC 487 
Gin Glu Gin Lys Ala Arg Arg Ser Gly Asn Pro Ser Ser Leu Ser Asp 
130 135 140 145 

ATT GGC AAC CGA CGC TCC CCT CCG CCA TCT CTA GCC AAG CAG AAG CAA 535 
He Gly Asn Arg Arg Ser Pro Pro Pro Ser Leu Ala Lys Gin Lys Gin 
150 155 160 

AAG CAG GCG GAA CAT GTT CCC CCG TAT GAC GTG GTG CCC TCC ATG CGG 583 
Lys Gin Ala Glu His Val Pro Pro Tyr Asp Val Val Pro Ser Met Arg 
165 170 175 

CCT GTG GTG CTG GTG GGA CCC TCT CTG AAA G6T TAT GAG GTC ACA GAC 631 
Pro Val Val Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp 
180 185 190 

ATG ATG CAG AAG GCT CTC TTC GAC TTC CTC AAA CAC AGA TTT GAT GGC 679 
Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg Phe Asp Gly 
195 200 205 

AGG ATC TCC ATC ACC CGA GTC ACA GCC GAC CTC TCC CTG GCA AAG CGA 727 
Arg He Ser He Thr Arg Val Thr Ala Asp Leu Ser Leu Ala Lys Arg 
210 215 220 225 

TCT nTO rTT" aar anT rnn anr aar; arw? nrr t^tt* rt^p n-hr* nnn n*nr> n»rwr t-tc 
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Ser Val Leu Asn Asn Pro Gly Lys Arg Thr He He Glu Arg Ser Ser 
230 235 240 

GCC CGC TCC AGC ATT GCG GAA GTG CAG AGT GAG ATC GAG CGC ATA TTT 
Ala Arg Ser Ser lie Ala Glu Val Gin Ser Glu He Glu Arg 11* Se 
245 250 255 

GAG CTG GCC AAA TCC CTG CAG CTA GTA GTG TTG GAC GCT GAC ACC ATC 
Glu Leu Ala Lys Ser Leu Gin Leu Val Val Leu Asp Ala ^ Tte lie 
260 265 270 

5f ^ CCA GCA CAG CTG GCC AAG ACC TCG CTG GCC CCC ATC ATC GTC 
His Pro Ala Gin Leu Ala Lys Thr Ser Leu Ala Pro He He Val 

280 285 

m GTC AAA GTG TCC TCA CCA AAG GTA CTC CAO CGT CTC ATT CGC TCC 
Phe Val Lys Val Ser Ser Pro Lys Val Leu Gin Arg Leu lie S sJr 

295 300 

CGG GGG AAG TCA CAG ATC AAG CAC CTQ ACC GTA CAG ATG ATG GCA TAT 
Arg Gly Lys Ser Gin Met Lys His Leu Thr Val Qln Met Met Ala ^ 
310 315 

GAT AAG CTG OTT CAG TQC CCA CCG QAQ TCA TTT GAT GTG ATT CTG GAT 
Asp Lys Leu Val Gin Cys Pro Pro Glu Ser Phe Asp Vai YS S2 aS 
325 330 335 

GAG AAC CAG CTG GAG GAT GCC TGT GAG CAC CTG GCT GAG TAC CTG GAG 
Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu Tyr Leu Glu 
340 345 



350 



GTT TAC TGG CGG GCC AOG CAC CAC CCA GCC CCT GGC CCC GGA CTT CTG 
Val Tyr Trp Arg Ala Thr His His Pro Ala Pro Gly Pro Gly Leu Leu 

355 350 



365 



GCT CCT CCC AGT GCC ATC CCC GGA CTT CAG AAC CAG CAG CTG CTG GGG 
Gly Pro Pro Ser Ala lie Pro Gly Leu Gin Asn Gin Gin Leu Leu Gly 
370 375 ^ 



385 



GAG CGT GGC GAG GAG CAC TCC CCC CTT GAG CGG GAC AGC TTG ATG CCC 
Glu Arg Gly Glu Glu His Ser Pro Leu Glu Arg Asp Ser Leu Met Pro 
3^0 395 400 

TCT GAT GAG GCC AGC GAG AGC TCC CGC CAA GCC TGG ACA GGA TCT TCA 
Ser Asp Glu Ala Ser Glu Ser Ser Arg Gin Ala Trp Thr Gly Ser Ser 
405 410 415 

CAG CGT AGC TCC CGC CAC CTG GAG GAG GAC TAT GCA GAT GCC TAC CAG 
Gin Arg Ser Ser Arg His Leu Glu Glu Asp Tyr Ala Asp Ala Tyr Gin 
420 425 430 

GAC CTG TAC CAG CCT CAC CGC CAA CAC ACC TCG GGG CTG CCT AGT GCT 
Asp Leu Tyr Gin Pro His Arg Gin His Thr Ser Gly Leu Pro Ser Ala 
435 440 445 



823 



871 



919 



967 



1015 



1063 



1111 



1159 



1207 



1255 



1303 



1351 



1399 



AAC GGG CAT GAC CCC CAA GAC CGG CTT CTA Grc ran n»r« tv-tv r^7^?v r^i^n 
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Asn Gly His Asp Pro Gin Asp Arg Leu Leu Ala Gin Asp Ser Glu His 
450 455 460 465 

AAC CAC AGT GAC CGG AAC TGG CAG CGC AAC CGG CCT TGG CCC AAG GAT 1495 
Asn His Ser Asp Arg Asn Trp Gin Arg Asn Arg Pro Trp Pro Lys Asp 
470 475 480 

AGC TAC TGA CAG C CTCCTGCTGC CCTACCCTGG CAGGCACAGG 1543 
Ser Tyr * 

CGCAGCTGGC TG6GGGGCCC ACTCCAGGCA GGQTGGC6TT AGACTGGCAT 1593 

CAGGCTGGCA CTAGGCTCAG CCCCCAAAAC CCCCTGCCCA GCCCCAGCTT CAGGGCTOCC 1648 

TGTGGTCCCA AGGTTCTGGG AGAAACAGGG GACCCCCTCA CCTCCTGGGC AGTGACCXICT 1708 

ACTAGGCTCC CATTCCAGGT ACTAGCTGTG TGTTCTGCAC CCCTGGCACC TTCfCTCTCCT 1768 

CCCACACAGG AAGCTGCCCC ACTGGGCAGT GCCCTCAGGC CAGGATCCCC TTAGCSUS^T 1828 

CCTTCCCACC AGACTCAGGG AAGGGATGCC CCATTAAAGT GACAAAAGGG TQGGTCTGGG 1888 

CACCATGGCA TGAGGATUSAA ACAAGGTCCC TGAGCAGGCA CAAGTCCTGA CAGTCAAGGG 1948 

ACTGCTTT6G CATCCAGGGC CTCCAGTCAC CTCACTGCCA TACATTAGAA ATGAGACAAT 2008 

TCAAAGCCCC CCCAGGGTGG CACACCCATC TGTTGCTGGG GT6TCGCAGC CACATCCakAG 2068 

ACTGGAGCAG CAGGCTGGCC ACGCTTGGGC CAGAGAGAGC TCACAGCTGA AGCTCTTGGA 2128 

GGGAAGGGCT CTCCTCACCC AATCG 2153 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DKA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 51. .1492 

(D) OTHER INFORMATION: /product = "A Beta3 subunit of human 
calcium channel" 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGCCCCCGOC GCC6CTCGTT CCCCCGACCC GGACTCCCCC ATGTATGACG ACTCCTACGT 60 
GCCCGGGTTT GAGGACTCGG AGGCGQTTTC AGCCGACTCC TACACCAGCC GCCCATCTCT 120 
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GGACTCAGAC GTCTCCCTGG AGG5AGGACCG GGAC3AGTGCC CGGCGTGAAG TAGAGAGCCA 180 

OGCTCAGCAG CAGCTCGAAA GGGCCAAGCA CAAACCTGTG 6CATTTGCGG TOAGGACCAA 240 

TQTCAGCTAC TGTGGCGTAC TGGATGAGGA GT6CCCAGTC CAGGGCTCTG GAGTCAACTT 300 

TGAGGCCRAA GATTTTCTGC ACATTAAAGA GAAGTACAGC AATGACTGGT GQATCGGGCG 360 

GCTAGTGAAA GAGGGCGGGG ACATCGCCTT CATCCCCA6C CCCCftGCGCC TGQAGAGCAT 420 

CCGGCTCAAA CAGGA6CAGA AG6CCAGGA6 ATCTGGGAAC CCTTCCAGCC TGA6TCAC3VT 480 

TGGCAACCOA CGCTCCCCTC CGCCATCTCT AGCCAAGCSG AAQCAAAAGC AGGCQGAACA 540 

TGTTCCCCCG TATOACGTGG TCCCCTCCAT GCGQCCTGTG GTGCTCGTGG QftCCCTCTCT 600 

GAAAGGTTAT GAOGTCACAG ACATGATGCA GAAGGCTCTC TTCGACTTCC TOIAACACAG 660 

ATTTGATGGC AGGATCTCCA TCACCOOaGT CACftGCCGAC CTCTCCCTCG CaAAGCGATC 720 

TGTGCTCAAC AATCCGGGCA AGAGQACCAT CATTGaOCGC TCCTCTGCCC GCTCCA6CAT 780 

TGCGGAAGTG C3M3AGTGAGA TCGASCGCAT ATTTGAGCTQ GCCaAATCCC TGCAGCTRGT B40 

AGTGTTQGAC GCTGACACCA TCOACCACCC AGCACAQCTO GCCAAGACCT COCTQGCCCC 900 

CATCATCGTC TTTQTCAAAG TGTCCTCACC AAAGGTACTC CAOCQTCTCA TTCGCTCCCG 960 

CK3CM3AAQTCA CAGATOAAGC ACCTGACCGT ACAGATGATO GCATATCMA AGCTGGTTCA 1020 

GTGCCCACCG GACSTCATTTG ATGTGATTCT GGATGAGAAC CAGCTGGAGG ATGCCTGTGA 1080 

GCACCTGGCT GAGTACXTTOG AGGTTTACTG GOQGGCCACG CACCACCCAG CCCCTGGCCC 1140 

CQQACTTCTG GGTCCTCCCA GTGCCaTCCC CGGACTTCAG AACCAGCAGC TGCTCGQG6A 1200 

GCGTGGCGAG GAfiCACTCCC CCCTTGAGCG GGACAGCTTG ATGCCCTCTG ATGAGGCCAG 1260 

CCWGftGCTCC CGCXaAGCCT GGACAGCaTC TTCHCAGCGJ A6CTCCCGCC ACCTGGAGGA 1320 

GCaCTATGCA GATGCCTACC AGGACCTQTA CCAQCCTCAC COCCftACACA CCTCGGGGCT 1380 

OCCTAOTQCT AACOGGCATG ACCCCCAaGA CCGGCTTCTA GCCC3W3GACT CAGAACAO^ 1440 

CKACAGTQAC CGCSAACTGGC AGCGCAACCQ GCCTTGQCCC AAGGATAQCT ACTCAC3M3CC 1500 

TCCT6CTGCC CTACCCTGQC AGGCACAGGC GCAGCT6GCT GGGGGGCCCA CTCCAGGCAG 1560 

GCrrGGCGTTA GACTGOCATC AG6CTGGCAC TAGGCTCAGC CCCCAAAACC CCCTGCCCAG 1620 

CCCCAGCTTC AGGGCTGCCT GTGGTCCCAA GGTTCTGGGA GAAACAGGGG ACCCCCTCAC 1680 

CTCCTGGGCA GTGACCCCTA CTAGGCTCCC ATTCCAGGTA CTAGCTGTGT GTTCTGCACC 1740 

CCTGGCACCT TCCTCTCCTC CCACACAQ6A AGCTGCCCCa CTGGGCAGT6 CCCTCAGGCC 1800 
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AGGATCCCCT TAGCAGGGTC CTTCCCACCA GACTCAGGGA AGGGATGCCC CATTAAAGTG 1860 

ACAAAAGGGT GGGTGTGGGC ACCATGGCAT GAGGAAGAAA CAAGGTCCCT GAGCAGGCAC 1920 

AAGTCCTGAC AGTCAAGGGA CTGCTTTGGC ATCCAGGGCC TCCAGTCACC TCACTGCCAT 1980 

ACATTAGAAA TGAGACAATT CAAAGCCCCC CCAGGGTGGC ACACCCATCT GTTGCTCGGG 2040 

TGTGGCAGCC ACATCCAAGA CTGGAGCAGC AQGCTOGCCA CGCTTGGGCC AGAGAGAGCT 2100 

CACAGCTGAA GCTCTTGGAQ GGAAGGGCTC TCCTCACCCA ATCG 2144 
(2) INFORHATION FOR SEQ ID N0:21: 

(i) SEQDENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Oligonucleotide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTCT^ACCA TCTCTGATAC CAGCCCCA 28 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: mA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 237,. 7769 

(D) OTHER INFORMATION: /standard_name« "Alpha-lA-l" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CCGGGCAGCC GCCTTCTGAG CCCCCGACCC 60 

GAGGCGCCGA GCCGCCGCCG CCCGATGGGC TGGGCCGTGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCAGC CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCAGCGTA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 5 84 
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Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
^5 10 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 30 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GCG CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
50 55 60 

ATC CCC GTC CGA CAG AAC TGC CTC ACG GTT AAC CGG TCT CTC TTC CTC 476 
He Pro Val Arg Gin Asn Cys Leu Thr Val Asn Arg Ser Leu Phe Leu 

70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 524 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys He Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 572 
Trp Pro Pro Phe Glu Tyr Met He Leu Ala Thr He He Ala Asn Cys 
100 105 no 

ATC GTC CTC GCA CTG GAG CAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 620 
He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 668 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 716 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
145 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 764 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACG GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 812 
Val Leu Thr Gly He Leu Ala Thr Val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 860 
Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA CAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 908 
He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT Q«;#; 
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Pro Leu l.eu Gin He Gly Leu Leu Leu Phe Phe Ala He Leu He Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 1004 
Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

^ ^ 2?^ ^'^ TOT CCG GCT CCA TGT 1052 

Phe Glu Glu Gly Thr Asp Asp He Gin Gly Glu Ser Pro Ala Pro Cvs 
260 265 270 

t?^^^^ CCC AAT GGG ACC AAA TGT CAG 1100 

Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 

CCC TAC TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 1148 
Pro Tyr Trp Glu Gly Pro Asn Asn Gly He Thr Gin Phe Asp Asn He 
290 295 300 

CTG TTT GCA GTG CTG ACT GTT TTC CAG TGC ATA ACC ATG GAA GGG TGG 1196 
Leu Phe Ala Val Leu Thr Val Phe Gin Cys He Thr Met Glu Gly Trp 

310 315 320 

ACT GAT CTC CTC TAC AAT AGC AAC GAT GCC TCA GGG AAC ACT TGG AAC 1244 
Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

TGG TTG TAC TTC ATC CCC CTC ATC ATC ATC GGC TCC TTT TTT ATG CTG 1292 
Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe Phe Met Leu 
340 345 350 

AAC CTT GTG CTG GGT GTG CTG TCA GGG GAG TTT GCC AAA GAA AGG GAA 1340 
Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arg Glu 
355 360 365 

CGG GTG GAG AAC CGG CGG GCT TTT CTG AAG CTG AGG CGG CAA CAA CAG 1388 
Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin Gin Gin 
370 375 380 

ATT GAA CGT GAG CTC AAT GGG TAC ATG GAA TGG ATC TCA AAA GCA GAA 1436 
He Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp He Ser Lys Ala Glu 
385 390 395 400 

GAG GTG ATC CTC GCC GAG GAT GAA ACT GAC GGG GAG CAG AGG CAT CCC 1484 
Glu Val He Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 
405 410 415 

TTT GAT GGA GCT CTG CGG AGA ACC ACC ATA AAG AAA AGC AAG ACA GAT 1532 
Phe Asp Gly Ala Leu Arg Arg Thr Thr He Lys Lys Ser Lys Thr Asp 
420 425 430 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 1580 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Ala Asp He Ala Ser 
435 440 445 

GTG GGT TCT CCC TTC GCC CGA GCC AGC ATT AAA AGT GCC AAG CTG GAG 1628 
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val Gly Ser Pro Phe Ala Arg Ala Ser He Lys Ser Ala Lys Leu Glu 



460 



^ ^0 lAG GAG AGG AGG ATG CGT TTC TAC ATC 

Asn ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Se lie 

^70 475 ' 

CGC CGC ATG GTC AAA ACT CAG GCC TTC TAC TOG ACT GTA CTC AGT TTC 
Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu S« ™ 
485 490 

OTA GCT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 
Val Ala Leu Asn Thr Leu Cys Val Ala He Val His Tyr Asn Gin Pro 
500 505 510 

^ ^ f""" ^ '^^^ GCA GAA TTC ATT TTC TTA GGA 

Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe He Phe Leu Gly 
515 520 525 ^ 

fl?. 2^ GAA ATG TTT ATA AAA ATC TAC GGG CTT GGG ACG CGG 

Leu Phe Met Ser Glu Met Phe He Lys Met Tyr Gly Leu Gly Thr tog 
"0 535 3 

CCT TAC TTC CAC TCT TCC TTC AAC TCC TTT GAC TOT GGG GTT ATC ATT 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val He He 

550 555 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 
Gly ser He Phe Glu Val He Trp Ala Val He Lys Pro Gly Tte 
565 570 575 

TTT GGA ATC AGC GTC TTA CGA GCC CTC AGG TTA TTC CGT ATT TTC AAA 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

if? ^ AAG TAC TGG GCA TCT CTC AGA AAC CTC GTC GTC TCT CTC CTC 
Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 600 605 

AAC TCC ATC AAG TCC ATC ATC AGC CTC TTC TTT CTC CTT TTC CTC TTC 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 

615 620 

AIT GTC GTC TTC GCC CTT TTC GGA ATC CAA CTC TTC GGC GGC CAG TTT 2156 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
-^25 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 
Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATC ACG GTC TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 2252 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG o^nn 



1676 



1724 



1772 



1820 



186B 



1916 



1964 



2012 



2060 



2108 



2204 
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Val Met Tyr Asp Gly lie Lys Ser Gin Gly Gly Val Gin Gly Gly Met 
675 680 685 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 2348 
Val Phe Ser lie Tyr Phe lie Val Leu Thr Leu Phe Gly Asn Tyr Thr 
S90 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAG AAT CTG GCC AAC GCC 2396 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asn Ala 
705 710 715 720 

CAG GAG CTC ACC AAG GTG GAG GCG GAC GAG CAA GAG GAA GAA QAA GCA 2444 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
725 730 735 

GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 2492 
Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 750 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 2540 
Ser Pro Leu Ser Ala Ala Asn Met Ser He Ala Val Lys Glu Gin Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CGG ACC AGT GAG 2588 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
770 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 2636 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 790 795 800 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 2684 
Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arg 
805 810 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTC GAC CCG CAG 2732 
Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 2780 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Glu Pro Thr 
835 840 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 2828 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu Asp Phe Leu Arg Lys 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 2876 
Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Gly 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 2924 
Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 2972 
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Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GGG TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 3020 
Ser heu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 925 

GCC GGG GAC CCC CAC CG6 AGG CAC GTG CAC CGG CAG GGG GGC AGC AGG 3068 
Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arg 
930 935 940 

GAG AGC C6C AGC GGG TCC CC6 CGC ACQ GGC GCG GAC GGG GAG CAT CGA 3116 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
945 950 955 960 

OQT CAT CGC GCG CAC CGC AGG CCC GGG GAG GAG GGT CC6 G7^ GAC AAG 3164 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG GAG CGG AGG GCQ CGG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 3212 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 3260 
Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
995 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 3308 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 3356 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC CCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 3404 
Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro He Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 3452 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp He Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 3500 
Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 3548 
Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 1100 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 3596 
Thr Asp Pro Gly Pro Met Leu Ala He Pro Ala Met Ala Thr Asn Pro 
1105 1110 1115 1120 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GOn ARC CCA TPC -^fiAA 
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Gin Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 
1125 1130 1135 

AAT CCC GGC CCC CCC AAG ACC CCC GAG MT AGC CTT ATC GTC ACC AAC 
Asn Pro Gly Pro Pro Lys Thr Pro Glu Asn Ser Leu lie Val Thr Asn 
1140 1145 1150 

CCC AGC GGC ACC CAG ACC AAT TCA GCT AAG ACT GCC AGG AAA CCC GAC 
Pro Ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg Lys Pro Asp 
1155 1160 • 1165 

CAC ACC ACA GTG GAC ATC CCC CCA GCC TGC CCA CCC CCC CTC AAC CAC 
His Thr Thr Val Asp He Pro Pro Ala Cys Pro Pro Pro Leu Asn His 
1170 1175 1180 



ACC GTC GTA CAA GTG AAC AAA AAC GCC AAC CCA GAC CCA CTG CCA AAA 
Thr Val Val Gin Val Asn Lys Asn Ala Asn Pro Asp Pro Leu Pro Lys 
1185 1190 1195 iJoO 

AAA GAG GAA GAG AAG AAG GAG GAG GAG GAA GAC GAC CGT GGG GAA GAC 
Lys Glu Glu Glu Lys Lys Glu Glu Glu Glu Asp Asp Arg Gly Glu Asp 
1205 1210 1215 

GGC CCT AAG CCA ATG CCT CCC TAT AGC TCC ATG TTC ATC CTG TCC ACG 
Gly Pro Lys Pro Met Pro Pro Tyr Ser Ser Met Phe He Leu Ser Thr 
1220 1225 1230 

ACC AAC CCC CTT CGC CGC CTG TGC CAT TAC ATC CTG AAC CTG CGC TAC 
Thr Asn Pro Leu Arg Arg Leu Cys His Tyr He Leu Asn Leu Arg Tvr 
1235 1240 1245 

TTT GAG ATG TGC ATC CTC ATG GTC ATT GCC ATG AGC AGC ATC GCC CTG 
Phe Glu Met Cys He Leu Met Val He Ala Met Ser Ser He Ala Leu 
1250 1255 1260 

GCC GCC GAG GAC CCT GTG CAG CCC AAC GCA CCT CGG AAC AAC GTG CTG 
Ala Ala Glu Asp Pro Val Gin Pro Asn Ala Pro Arg Asn Asn Val Leu 
1265 1270 1275 1280 

CGA TAC TTT GAC TAC GTT TTT ACA GGC GTC TTC ACC TTT GAG ATG GTG 
Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val 
1285 1290 1295 

ATC AAG ATG ATT GAC CTG GGG CTC GTC CTG CAT CAG GGT GCC TAC TTC 
He Lys Met He Asp Leu Gly Leu Val Leu His Gin Gly Ala Tyr Phe 
1300 1305 1310 

CGT GAC CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC AGT GGG GCC CTG 
Arg Asp Leu Trp Asn He Leu Asp Phe He Val Val Ser Gly Ala Leu 
1315 1320 1325 

GTA GCC TTT GCC TTC ACT GGC AAT AGC AAA GGA AAA GAC ATC AAC ACG 
Val Ala Phe Ala Phe Thr Gly Asn Ser Lys Gly Lys Asp He Asn Thr 
1330 1335 1340 



3692 



3740 



3788 



3836 



3884 



3932 



3980 



4026 



4076 



4124 



4172 



4220 



4268 



ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 



4316 
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?if Arg Val Leu Arg Pro Leu Lys Thr lie 

"50 1355 1360 

AAG CGG CTG CCA AAG CTC AAG GOT GTG TTT GAC TGT GTG GTG AAC TCA 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 
Leu Lys Asn Val Phe Asn He Leu He Val Tyr Met Leu Phe Met Phe 

1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 
He Phe Ala Val Val Ala Val Gin Leu Phe Lys 61y Lys Phe Phe His 
"35 1400 1405 

^ ^ TTT GAG AAA GAT TGT CQA GGC AAA TAC 

3^n^P ^^"^ ^^"^ Lys Asp Cys Arg Gly Lys TVr 

J-^-^O 1415 1420 

I^^ ^ ^ ^ GTG AAG GCG CGA GAC CGG GAG TGG ^ 

Leu Leu Tyr Glu Lys Asn Qlu Val Lys Ala Arg Asp Arg Glu ^ L^ 

"30 1435 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 
Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu 
1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA GGC TGG CCA CAG GTC CTC AAG CAT TCG 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 

1465 1470 

CTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGC CCC GGG TAC CGC ATG 
Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 
^^"^ Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 

1490 1495 2^500 

TTT GTC AAT ATC TTT GTG GCC TTG ATC ATC ATC ACC TTC CAG GAG CAA 

lie Il« Thr Phe Gin Glu Gin 
"05 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTC GAG AAA AAT GAG AGG GCC 
Gly Asp Lys Met Mat Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
1525 1530 1535 

TCC ATT GAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATC CCG 
Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
1540 1545 1550 



4364 



4412 



4460 



4508 



4556 



4604 



4652 



4700 



4748 



4796 



4844 



4892 



CAG AAC AAG CAG AGC TTC CAG TAC CGC ATC TGG CAG TTC GTC GTC TCT 
Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 



4940 



CCG CCT TTC GAG TAC ACG ATC ATC GCC ATG ATC GCC CTC AAC ACC ATC 



4988 
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Pro Pro Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn Thr He 
1570 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT QCC 5036 
Val Leu Met Met Lys Phe Tyr Gly Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1600 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CT6 GAA TGT 5084 
Leu Arg Val Phe Asn He Val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 1610 1615 

GTG CTG AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 5132 
Val Leu Lys Val Met Ala Phe Gly He Leu Asn Tyr Phe Arg Asp Ala 
1620 1625 1630 

TGG AAC ATC TTC GAC TTT GTG ACT GTT CTG GGC A6C ATC ACC GAT ATC 5180 
Trp Asn He Phe Asp Phe Val Thr Val Leu Gly Ser He Thr Asp He 
1635 1640 1645 

CTC GTG ACT GAG TTT GG6 AAT CCG AAT AAC TTC ATC AAC CTG AGC TTT 5228 
Leu Val Thr Glu Phe Gly Asn Pro Asn Asn Phe He Asn Leu Ser Phe 
1650 1655 1660 

CTC CGC CTC TTC CGA GCT GCC CGG CTC ATC AAA CTT CTC CGT CAG GGT 5276 
Leu Arg Leu Phe Arg Ala Ala Arg Leu He Lys Leu Leu Arg Gin Gly 
1665 1670 1675 1680 

TAC ACC ATC CGC ATT CTT CTC TGG ACC TTT GTG CAG TCC TTC AAG GCC 5324 
Tyr Thr He Arg He Leu Leu Trp Thr Phe Val Gin Ser Phe Lys Ala 
1685 1690 1695 

CTG CCT TAT GTC TGT CTG CTG ATC GCC ATG CTC TTC TTC ATC TAT GCC 5372 
Leu Pro Tyr Val Cys Leu Leu He Ala Met Leu Phe Phe He Tyr Ala 
1700 1705 1710 

ATC ATT GGG ATG CAG GTG TTT GGT AAC ATT GGC ATC GAC GTG GAG GAC 5420 
He He Gly Met Gin Val Phe Gly Asn He Gly He Asp Val Glu Asp 
1715 1720 1725 

GAG GAC AGT GAT GAA GAT GAG TTC CAA ATC ACT GAG CAC AAT AAC TTC 5468 
Glu Asp Ser Asp Glu Asp Glu Phe Gin He Thr Glu His Asn Asn Phe 
1730 1735 1740 

CGG ACC TTC TTC CAG GCC CTC ATG CTT CTC TTC CGG AGT GCC ACC GGG 5516 
Arg Thr Phe Phe Gin Ala Leu Met Leu Leu Phe Arg Ser Ala Thr Gly 
1745 1750 1755 1760 

GAA GCT TGG CAC AAC ATC ATG CTT TCC TGC CTC AGC GGG AAA CCG TGT 5564 
Glu Ala Trp His Asn He Met Leu Ser Cys Leu Ser Gly Lys Pro Cys 
1765 1770 1775 

GAT AAG AAC TCT GGC ATC CTG ACT CGA GAG TGT GGC AAT GAA TTT GCT 5612 
Asp Lys Asn Ser Gly He Leu Thr Arg Glu Cys Gly Asn Glu Phe Ala 
1780 1785 1790 

TAT TTT TAC TTT GTT TCC TTC ATC TTC CTC TGC TCG TTT CTG ATG CTG R660 
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Tyr Phe Tyr Phe Val Ser Phe He Phe Leu Cys Ser Phe Leu Met Leu 
1795 1800 1805 

AAT CTC TTT GTC GCC GTC ATC ATG GAG AAC TTT GAG TAG CTC ACC CGA 5708 
Asn Leu Phe Val Ala Val He Met Asp Asn Phe Glu Tyr Leu Thr Aro 
1810 1815 1820 

GAC TCC TCC ATC CTG QGC CCC CAC CAC CTG GAT GAG TAG GTG CGT GTC 5756 
Asp Ser Ser He Leu Gly Pro His His Leu Asp Glu Tyr Val Arg Val 
^^25 1830 1835 1840 

TGG GCC GAG TAT GAC CCC GCA GCT TGG GGC C6C ATG CCT TAC CTG GAC 5804 
Trp Ala Glu Tyr Asp Pro Ala Ala Trp Gly Arg Met Pro Tyr Leu Asp 
1845 1850 1855 

ATG TAT CAG ATG CTG AGA CAC ATG TCT CCG CCC CTG GGT CTG GGG AAG 5852 
Met Tyr Gin Met Leu Arg His Met Ser Pro Pro Leu Gly Leu Gly Lys 
I860 1865 1870 

AAG TGT CCG GCC AGA GTG GCT TAC AAG CGG CTT CTG CGG ATG GAC CTG 5900 
Lys Cys Pro Ala Arg Val Ala Tyr Lys Arg Leu Leu Arg Met Asp Leu 
1875 1880 1885 

CCC GTC GCA GAT GAC AAC ACC GTC CAC TTC AAT TCC ACC CTC ATG GCT 5948 
Pro Val Ala Asp Asp Asn Thr Val His Phe Asn Ser Thr Leu Met Ala 
1890 1895 1900 

CTG ATC CGC ACA GCC CTG GAC ATC JkAG ATT GCC AAG GGA GGA GCC GAC 5996 
Leu He Arg Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp 
1^05 1910 1915 i^lo 

AAA CAG CAG ATG GAC GCT GAG CTG CGG AAG GAG ATG ATG GCG ATT TGG 6044 
Lys Gin Gin Met Asp Ala Glu Leu Arg Lys Glu Met Met Ala He Trp 
1925 1930 1935 

CCC AAT CTG TCC CAG AAG ACG CTA GAC CTG CTC GTC ACA CCT CAC AAG 6092 
Pro Asn Leu Ser Gin Lys Thr Leu Asp Leu Leu Val Thr Pro His Lys 
1940 1945 1950 

TCC ACG GAC CTC ACC GTG GGG AAG ATC TAC GCA GCC ATG ATG ATC ATG 6140 
Ser Thr Asp Leu Thr Val Gly Lys He Tyr Ala Ala Met Met He Met 
1955 I960 1965 

GAG TAC TAC CGG CAG AGC AAG GCC AAG AAG CTG CAG GCC ATG CGC GAG 6188 
Glu Tyr Tyr Arg Gin Ser Lys Ala Lys Lys Leu Gin Ala Met Arg Glu 
1970 1975 1980 

GAG CAG GAC CGG ACA CCC CTC ATG TTC CAG CGC ATG GAG CCC CCG TCC 6236 
Glu Gin Asp Arg ar Pro Leu Met Phe Gin Arg Met Glu Pro Pro Ser 
1985 1990 X995 2000 

CCA ACG CAG GAA GGG GGA CCT GGC CAG AAC GCC CTC CCC TCC ACC CAG 6284 
Pro Thr Gin Glu Gly Gly Pro Gly Gin Asn Ala Leu Pro Ser Thr Gin 
2005 2010 2015 



CTG GAC CCA GGA GGA GCC CTG ATG GCT CAC GAA AGC GGC CTC AAG GAG 



6332 
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Leu Asp Pro Gly Gly Ala Leu Met Ala His Glu Ser Gly Leu Lys Glu 
2020 2025 2030 

AGC CCG TCC TGG GTG ACC CAG CGT GCC CAG GAG ATG TTC CAG AAG ACG 6380 
Ser Pro Ser Trp Val Thr Gin Arg Ala Gin Glu Met Phe Gin Lys Thr 
2035 2040 2045 

GGC ACA TGG AGT CCG GAA CAA GGC CCC CCT ACC GAC ATG CCC AAC AGC 6428 
Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro Asn Ser 
2050 2055 2060 

CAG CCT AAC TCT CAG TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 6476 
Gin Pro Asn Ser Gin Ser Val Glu Met Arg Glu Met Gly Arg Asp Gly 
2065 2070 2075 2080 

TAC TCC GAC AGC GAG CAC TAC CTC CCC ATG GAA GGC CAG GGC CGG GCT 6524 
Tyr ser Asp Ser Glu His Tyr Leu Pro Met Glu Gly Gin Gly Arg Ala 
2085 2090 2095 

GCC TCC ATG CCC CGC CTC CCT GCA GAG AAC CAG AGG AGA AGG GGC CGG 6572 
Ala Ser Met Pro Arg Leu Pro Ala Glu Asn Gin Arg Arg Arg Gly Arg 
2100 2105 2110 

CCA CGT GGG AAT AAC CTC AGT ACC ATC TCA GAC ACC AGC CCC ATG AAG 6620 
Pro Arg Gly Asn Asn Leu Ser Thr He Ser Asp Thr Ser Pro Met Lys 
2115 2120 2125 

CGT TCA GCC TCC GTG CTG GGC CCC AAG GCC CGA CGC CTG GAC GAT TAC 6668 
Arg Ser Ala Ser Val Leu Gly Pro Lys Ala Arg Arg Leu Asp Asp Tyr 
2130 2135 2140 

TCG CTG 67^ CGG GTC CCG CCC GAG GAG AAC CAG CGG CAC CAC CAG CGG 6716 
Ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin Arg His His Gin Arg 
2145 2150 2155 2160 

CGC CGC GAC CGC AGC CAC CGC GCC TCT GAG CGC TCC CTG GGC CGC TAC 6764 
Arg Arg Asp Arg Ser His Arg Ala Ser Glu Arg Ser Leu Gly Arg Tyr 
2165 2170 2175 

ACC GAT GTG GAC ACA GGC TTG GGG ACA GAC CTG AGC ATG ACC ACC CAA 6812 
Thr Asp Val Asp Thr Gly Leu Gly Thr Asp Leu Ser Met Thr Thr Gin 
2180 2185 2190 

TCC GGG GAC CTG CCG TCG AAG GAG CGG GAC CAG GAG CGG GGC CGG CCC 6860 
Ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Arg Pro 
2195 2200 2205 

AAG GAT CGG AAG CAT CGA CAG CAC CAC CAC CAC CAC CAC CAC CAC CAC 6908 
Lys Asp Arg Lys His Arg Gin His His His His His His His His His 
2210 2215 2220 

CAT CCC CCG CCC CCC GAC AAG GAC CGC TAT GCC CAG GAA CGG CCG GAC 6956 
His Pro Pro Pro Pro Asp Lys Asp Arg Tyr Ala Gin Glu Arg Pro Asp 
2225 2230 2235 2240 

CAC GGC CGG GCA CGG GCT CGG GAC CAG CGC TGG TCC CGC TCG CCC AGC 7004 
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His Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2255 

GAG GGC CGA GAG CAC ATG GCG CAC CGG CAG GGC AGT A6T TCC GTA AGT 7052 
Glu Gly Arg Glu His Met Ala His Arg Gin Gly Ser Ser Ser Val Ser 
2260 2265 2270 

GGA AGC CCA GCC CCC TCA ACA TCT GGT ACC AGO ACT CCG OGG CGG GGC 7100 
Gly Ser Pro Ala Pro Ser Thr Ser Gly Thr Ser Thr Pro Arg Arg Gly 
2275 2280 2285 

CGC CGC CAG CTC CCC CAG ACC CCC TCC ACC CCC CGG CCA CAC GTG TCC 7148 
Arg Arg Gin Leu Pro Gin Thr Pro Ser Thr Pro Arg Pro His Val Ser 
2290 2295 2300 

TAT TCC CCT GTG ATC OGT AAG GCC GGC GGC TCG GOG CCC CCG CAG CAG 7196 
Tyr Ser Pro Val lie Arg Lys Ala Gly Gly Ser Gly Pro Pro Gin Gin 
2305 2310 23X5 2320 

CAG CAG CAG CAG CAG CAG CAG CAG CAG GCG GTG GCC AGG CCG GGC CGG 7244 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Ala Val Ala Arg Pro Gly Arg 
2325 2330 2335 

GCG GCC ACC AGC GGC CCT CGG AGG TAG CCA GGC CCC ACG GCC GAG CCT 7292 
Ala Ala Thr Ser Gly Pro Arg Arg Tyr Pro Gly Pro Thr Ala Glu Pro 
2340 2345 2350 

CTG GCC GGA GAT CGG CCG CCC ACG GGG GGC CAC AGC AGC GGC CGC TCG 7340 
Leu Ala Gly Asp Arg Pro Pro Thr Gly Gly His Ser Ser Gly Arg Ser 
2355 2360 2365 

CCC AGG ATG GAG AGG CGG GTC CCA GGC CCG GCC CGG AGC GAG TCC CCC 7388 
Pro Arg Met Glu Arg Arg Val Pro Gly Pro Ala Arg Ser Glu Ser Pro 
2370 2375 2380 

AGG GCC TGT CGA CAC GGC GGG GCC CGG TGG CCG GCA TCT GGC CCG CAC 7436 
Arg Ala Cys Arg His Gly Gly Ala Arg Trp Pro Ala Ser Gly Pro His 
2385 2390 2395 2400 

GTG TCC GAG GGG CCC CCG GGT CCC CGG CAC CAT GGC TAG TAG CGG GGC 7484 
Val Ser Glu Gly Pro Pro Gly Pro Arg His His Gly Tyr Tyr Arg Gly 
2405 2410 2415 

TCC GAG TAG GAG GAG GCC GAT GGC CCG GGC AGC GGG GGC GGC GAG GAG 7532 
Ser Asp Tyr Asp Glu Ala Asp Gly Pro Gly Ser Gly Gly Gly Glu Glu 
2420 2425 2430 

GCC ATG GCC GGG GCC TAG GAG GCG CGA CCC CCC GTA CGA CAC GCG TCC 7580 
Ala Met Ala Gly Ala Tyr Asp Ala Pro Pro Pro Val Arg His Ala Ser 
2435 2440 2445 

TGG GGC GGC AGC GGG CGC TCG GCC AGG ACT CCC CGG GCC TCG GGC GGG 7628 
Ser Gly Ala Thr Gly Arg Ser Pro Arg Thr Pro Arg Ala Ser Gly Pro 
2450 2455 2460 



GGC TGC GGG TCG CCT TCT CGG CAC GGC COn rria CTr rrc AAP nnr trh 
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Ala Cys Ala Ser Pro Ser Arg His Gly Arg Arg Leu Pro Asn Gly Tyr 
2465 2470 2475 2480 

TAC CCG GC6 CAC GGA CTG GCC AGG CCC CGC GGG CCG GGC TCC AG6 AA8 7724 
Tyr Pro Ala His Gly I*eu Ala Arg Pro Arg Gly Pro Gly Ser Arg Lys 
2485 2490 2495 

GGC CTG CAC GAA CCC TAC AGC GAG AGT GAC GAT GAT TGG TGC TAAGCCCGGG 7776 
Gly Leu His Glu Pro Tyr Ser Glu Ser Asp Asp Asp Trp Cys 
2500 2505 2510 

CGAGQTGGCG CCCGCCCGGC CCCCCAC6CA CC 7808 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: DNA (genomic) 



(ix) FBATORE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 237.. 7037 

(D) OTHER INFORMATION: /standard_naiiie'= "Alpha -lA- 2" 



(xi) SEQDENCE DESCRIPTION: SEQ ID NO: 23: 

GATGTCCCGA GCTGCTATCC CCGGCTCGGC CC6G6CA6CC 6CCTTCTGAG CCCCCGACCC 60 

GAGGC6CCGA GCCGCCGCCO CCCGATGGGC TGGGCC6TGG AGCGTCTCCG CAGTCGTAGC 120 

TCCAGCCGCC GCGCTCCCAG CCCCGGCA6C CTCAGCATCA GCGGCGGCGG CGGCGGCGGC 180 

GGCGTCTTCC GCATCGTTCG CCGCA6G6TA ACCCGGAGCC CTTTGCTCTT TGCAGA 236 

ATG GCC CGC TTC GGA GAC GAG ATG CCG GCC CGC TAC GGG GGA GGA GGC 284 
Met Ala Arg Phe Gly Asp Glu Met Pro Ala Arg Tyr Gly Gly Gly Gly 
15 10 15 

TCC GGG GCA GCC GCC GGG GTG GTC GTG GGC AGC GGA GGC GGG CGA GGA 332 
Ser Gly Ala Ala Ala Gly Val Val Val Gly Ser Gly Gly Gly Arg Gly 
20 25 30 

GCC GGG GGC AGC CGG CAG GGC GGG CAG CCC GGG GC6 CAA AGG ATG TAC 380 
Ala Gly Gly Ser Arg Gin Gly Gly Gin Pro Gly Ala Gin Arg Met Tyr 
35 40 45 

AAG CAG TCA ATG GCG CAG AGA GCG CGG ACC ATG GCA CTC TAC AAC CCC 428 
Lys Gin Ser Met Ala Gin Arg Ala Arg Thr Met Ala Leu Tyr Asn Pro 
50 55 60 
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ATC CCC GTC CGA CAG AAC TGC CTC ACQ GTT AAC CGG TCT CTC TTC CTC 476 
lie Pro Val Arg Gin Asn Cys Leu T!hr Val Asn Arg Ser Leu Phe Leu 
^5 70 75 80 

TTC AGC GAA GAC AAC GTG GTG AGA AAA TAC GCC AAA AAG ATC ACC GAA 524 
Phe Ser Glu Asp Asn Val Val Arg Lys Tyr Ala Lys Lys He Thr Glu 
85 90 95 

TGG CCT CCC TTT GAA TAT ATG ATT TTA GCC ACC ATC ATA GCG AAT TGC 572 
Tip Pro Pro Phe Glu Tyr Met He Leu Ala Thr He He Ala Asn Cys 
100 105 110 

ATC GTC CTC 6CA CTG GAG GAG CAT CTG CCT GAT GAT GAC AAG ACC CCG 620 
He Val Leu Ala Leu Glu Gin His Leu Pro Asp Asp Asp Lys Thr Pro 
115 120 125 

ATG TCT GAA CGG CTG GAT GAC ACA GAA CCA TAC TTC ATT GGA ATT TTT 668 
Met Ser Glu Arg Leu Asp Asp Thr Glu Pro Tyr Phe He Gly He Phe 
130 135 140 

TGT TTC GAG GCT GGA ATT AAA ATC ATT GCC CTT GGG TTT GCC TTC CAC 716 
Cys Phe Glu Ala Gly He Lys He He Ala Leu Gly Phe Ala Phe His 
"5 150 155 160 

AAA GGC TCC TAC TTG AGG AAT GGC TGG AAT GTC ATG GAC TTT GTG GTG 764 
Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn Val Met Asp Phe Val Val 
165 170 175 

GTG CTA ACQ GGC ATC TTG GCG ACA GTT GGG ACG GAG TTT GAC CTA CGG 812 
Val Leu Thr Gly He Leu Ala Thr Val Gly Thr Glu Phe Asp Leu Arg 
180 185 190 

ACG CTG AGG GCA GTT CGA GTG CTG CGG CCG CTC AAG CTG GTG TCT GGA 860 
Thr Leu Arg Ala Val Arg Val Leu Arg Pro Leu Lys Leu Val Ser Gly 
195 200 205 

ATC CCA AGT TTA CAA GTC GTC CTG AAG TCG ATC ATG AAG GCG ATG ATC 908 
He Pro Ser Leu Gin Val Val Leu Lys Ser He Met Lys Ala Met He 
210 215 220 

CCT TTG CTG CAG ATC GGC CTC CTC CTA TTT TTT GCA ATC CTT ATT TTT 956 
Pro Leu Leu Gin He Gly Leu Leu Leu Phe Phe Ala He Leu He Phe 
225 230 235 240 

GCA ATC ATA GGG TTA GAA TTT TAT ATG GGA AAA TTT CAT ACC ACC TGC 1004 
Ala He He Gly Leu Glu Phe Tyr Met Gly Lys Phe His Thr Thr Cys 
245 250 255 

TTT GAA GAG GGG ACA GAT GAC ATT CAG GGT GAG TCT CCG GCT CCA TGT 1052 
Phe Glu Glu Gly Thr Asp Asp He Gin Gly Glu Ser Pro Ala Pro Cys 
260 265 270 

GGG ACA GAA GAG CCC GCC CGC ACC TGC CCC AAT GGG ACC AAA TGT CAG 1100 
Gly Thr Glu Glu Pro Ala Arg Thr Cys Pro Asn Gly Thr Lys Cys Gin 
275 280 285 
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CCC TAG TGG GAA GGG CCC AAC AAC GGG ATC ACT CAG TTC GAC AAC ATC 1148 
Pro Tyr Trp Glu Gly Pro Asn Asn Gly lie Thr Gin Phe Asp Asn lie 
290 295 300 

CTG TTT GCA GTG CTG ACT GTT TTC CAG TGC ATA ACC ATG GAA GGG TGG 1196 
Leu Phe Ala Val Leu Thr Val Phe Gin Cys lie Thr Met Glu Gly Trp 

310 315 320 

ACT GAT CTC CTC TAC AAT AGC AAC GAT GCC TCA GGG AAC ACT TGG AAC 1244 
Thr Asp Leu Leu Tyr Asn Ser Asn Asp Ala Ser Gly Asn Thr Trp Asn 
325 330 335 

TGG TTG TAC TTC ATC CCC CTC ATC ATC ATC GGG TCC TTT TTT ATG CTG 1292 
Trp Leu Tyr Phe He Pro Leu He He He Gly Ser Phe Phe Met Leu 
340 345 350 

AAC GTT GTG CTG GGT GTG CTG TCA GGG GAG TTT GCC AAA GAA AGG GAA 1340 
Asn Leu Val Leu Gly Val Leu Ser Gly Glu Phe Ala Lys Glu Arq Glu 
355 360 365 

CGGGTGGAGAACCGGCGGGCTTTTCTGAAGGTGAGGCGGCAACAACAG 1388 
Arg Val Glu Asn Arg Arg Ala Phe Leu Lys Leu Arg Arg Gin Gin Gin 
370 375 380 

ATT GAA CGT GAG CTC AAT GGG TAC ATG GAA TGG ATC TCA AAA GCA GAA 1436 
He Glu Arg Glu Leu Asn Gly Tyr Met Glu Trp He Ser Lys Ala Glu 
385 390 

GAG GTG ATC CTC GCC GAG GAT GAA ACT GAC GGG GAG CAG AGG CAT CCC 1484 
Glu Val He Leu Ala Glu Asp Glu Thr Asp Gly Glu Gin Arg His Pro 
405 410 415 

TTT GAT GGA GCT CTG CGG AGA ACC ACC ATA AAG AAA AGC AAG ACA GAT 1532 
Phe Asp Gly Ala Leu Arg Arg Thr Thr He Lys Lys Ser Lys Thr Asp 
420 425 430 

TTG CTC AAC CCC GAA GAG GCT GAG GAT CAG CTG GCT GAT ATA GCC TCT 1580 
Leu Leu Asn Pro Glu Glu Ala Glu Asp Gin Leu Ala Asp He Ala Ser 
435 440 445 

GTG GGT TCT CCC TTC GCC CGA GCC AGC ATT AAA AGT GCC AAG CTG GAG 1628 
Val Gly Ser Pro Phe Ala Arg Ala Ser He Lys Ser Ala Lys Leu Glu 
450 455 4S0 

AAC TCG ACC TTT TTT CAC AAA AAG GAG AGG AGG ATG CGT TTC TAC ATC 1676 
Asn Ser Thr Phe Phe His Lys Lys Glu Arg Arg Met Arg Phe Tyr He 
4^5 470 475 480 

CGC CGC ATG GTC AAA AGT CAG GCC TTC TAC TGG ACT GTA CTC AGT TTG 1724 
Arg Arg Met Val Lys Thr Gin Ala Phe Tyr Trp Thr Val Leu Ser Leu 
485 490 495 

GTA GGT CTC AAC ACG CTG TGT GTT GCT ATT GTT CAC TAC AAC CAG CCC 1772 
Val Ala Leu Asn Thr Leu Cys Val Ala He Val His Tyr Asn Gin Pro 
500 505 510 
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GAG TGG CTC TCC GAC TTC CTT TAC TAT GCA GAA TTC ATT TTC TTA GGA 1820 
Glu Trp Leu Ser Asp Phe Leu Tyr Tyr Ala Glu Phe lie Phe Leu Gly 
515 520 525 

CTC TTT ATG TCC GAA ATG TTT ATA AAA ATG TAC GGG CTT GGG ACG CGG 1868 
Leu Phe Met Ser Glu Met Phe lie Lys Met Tyr Gly Leu Gly Thr Arq 
530 535 

CCT TAC TTC CAC TCT TCC TTC AAC TGC TTT GAC TGT GGG GTT ATC ATT 1916 
Pro Tyr Phe His Ser Ser Phe Asn Cys Phe Asp Cys Gly Val He He 
545 550 555 560 

GGG AGC ATC TTC GAG GTC ATC TGG GCT GTC ATA AAA CCT GGC ACA TCC 1964 
Gly Ser He Phe Glu Val He Trp Ala Val He Lys Pro Gly Thr Ser 
565 570 575 

TTT GGA ATC AGC GTG TTA CGA GCC CTC AGG TTA TTG CGT ATT TTC AAA 2012 
Phe Gly He Ser Val Leu Arg Ala Leu Arg Leu Leu Arg He Phe Lys 
580 585 590 

GTC ACA AAG TAC TGG GCA TCT CTC AGA AAC CTG GTC GTC TCT CTC CTC 2060 
Val Thr Lys Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser Leu Leu 
595 BOO 605 

AAC TCC ATG AAG TCC ATC ATC AGC CTG TTG TTT CTC CTT TTC CTG TTC 2108 
Asn Ser Met Lys Ser He He Ser Leu Leu Phe Leu Leu Phe Leu Phe 
CIO 615 620 

ATT GTC GTC TTC GCC CTT TTG GGA ATG CAA CTC TTC GGC GGC CAG TTT 2156 
He Val Val Phe Ala Leu Leu Gly Met Gin Leu Phe Gly Gly Gin Phe 
S25 630 635 640 

AAT TTC GAT GAA GGG ACT CCT CCC ACC AAC TTC GAT ACT TTT CCA GCA 2204 
Asn Phe Asp Glu Gly Thr Pro Pro Thr Asn Phe Asp Thr Phe Pro Ala 
645 650 655 

GCA ATA ATG ACG GTG TTT CAG ATC CTG ACG GGC GAA GAC TGG AAC GAG 2252 
Ala He Met Thr Val Phe Gin He Leu Thr Gly Glu Asp Trp Asn Glu 
660 665 670 

GTC ATG TAC GAC GGG ATC AAG TCT CAG GGG GGC GTG CAG GGC GGC ATG 2300 
Val Met Tyr Asp Gly He Lys Ser Gin Gly Gly Val Gin Gly Gly M6t 
675 680 685 

GTG TTC TCC ATC TAT TTC ATT GTA CTG ACG CTC TTT GGG AAC TAC ACC 2348 
Val Phe Ser He Tyr Phe He Val Leu Thr Leu Phe Gly Asn Tyr Thr 
690 695 700 

CTC CTG AAT GTG TTC TTG GCC ATC GCT GTG GAC AAT CTG GCC AAC GCC 2396 
Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn Leu Ala Asn Ala 
705 710 715 720 

CAG GAG CTC ACC AAG GTG GAG 6CG GAC GAG CAA GAG GAA GAA GAA GCA 2444 
Gin Glu Leu Thr Lys Val Glu Ala Asp Glu Gin Glu Glu Glu Glu Ala 
725 730 735 
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GCG AAC CAG AAA CTT GCC CTA CAG AAA GCC AAG GAG GTG GCA GAA GTG 2492 
Ala Asn Gin Lys Leu Ala Leu Gin Lys Ala Lys Glu Val Ala Glu Val 
740 745 750 

AGT CCT CTG TCC GCG GCC AAC ATG TCT ATA GCT GTG AAA GAG CAA CAG 2540 
Ser Pro Leu Ser Ala Ala Asn Met Ser lie iaa Val Lys Glu Oln Gin 
755 760 765 

AAG AAT CAA AAG CCA GCC AAG TCC GTG TGG GAG CAG CG6 ACC AGT GAG 2586 
Lys Asn Gin Lys Pro Ala Lys Ser Val Trp Glu Gin Arg Thr Ser Glu 
770 775 780 

ATG CGA AAG CAG AAC TTG CTG GCC AGC CGG GAG GCC CTG TAT AAC GAA 2636 
Met Arg Lys Gin Asn Leu Leu Ala Ser Arg Glu Ala Leu Tyr Asn Glu 
785 790 795 800 

ATG GAC CCG GAC GAG CGC TGG AAG GCT GCC TAC ACG CGG CAC CTG CGG 2684 
Met Asp Pro Asp Glu Arg Trp Lys Ala Ala Tyr Thr Arg His Leu Arg 
805 810 815 

CCA GAC ATG AAG ACG CAC TTG GAC CGG CCG CTG GTG GTG GAC CCG CAG 2732 
Pro Asp Met Lys Thr His Leu Asp Arg Pro Leu Val Val Asp Pro Gin 
820 825 830 

GAG AAC CGC AAC AAC AAC ACC AAC AAG AGC CGG GCG GCC GAG CCC ACC 2780 
Glu Asn Arg Asn Asn Asn Thr Asn Lys Ser Arg Ala Ala Glu Pro Thr 
835 840 845 

GTG GAC CAG CGC CTC GGC CAG CAG CGC GCC GAG GAC TTC CTC AGG AAA 2826 
Val Asp Gin Arg Leu Gly Gin Gin Arg Ala Glu T^p Phe Leu Arg Lys 
850 855 860 

CAG GCC CGC TAC CAC GAT CGG GCC CGG GAC CCC AGC GGC TCG GCG GGC 2876 
Gin Ala Arg Tyr His Asp Arg Ala Arg Asp Pro Ser Gly Ser Ala Gly 
865 870 875 880 

CTG GAC GCA CGG AGG CCC TGG GCG GGA AGC CAG GAG GCC GAG CTG AGC 2924 
Leu Asp Ala Arg Arg Pro Trp Ala Gly Ser Gin Glu Ala Glu Leu Ser 
885 890 895 

CGG GAG GGA CCC TAC GGC CGC GAG TCG GAC CAC CAC GCC CGG GAG GGC 2972 
Arg Glu Gly Pro Tyr Gly Arg Glu Ser Asp His His Ala Arg Glu Gly 
900 905 910 

AGC CTG GAG CAA CCC GG6 TTC TGG GAG GGC GAG GCC GAG CGA GGC AAG 3020 
Ser Leu Glu Gin Pro Gly Phe Trp Glu Gly Glu Ala Glu Arg Gly Lys 
915 920 925 

GCC GGG GAC CCC CAC CGG AGG CAC GTG CAC CGG CAG GG6 GGC AGC AGG 3068 
Ala Gly Asp Pro His Arg Arg His Val His Arg Gin Gly Gly Ser Arg 
930 935 940 

GAG AGC CGC AGC GGG TCC CCG CGC ACG GGC GCG GAC GGG GAG CAT CGA 3116 
Glu Ser Arg Ser Gly Ser Pro Arg Thr Gly Ala Asp Gly Glu His Arg 
945 950 955 960 
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CGT CAT CGC OCXS CAC CGC AGO CCC GG6 GAG GAG GGT CCG GAG GAC AAG 3164 
Arg His Arg Ala His Arg Arg Pro Gly Glu Glu Gly Pro Glu Asp Lys 
965 970 975 

GCG QhQ CGG AGG 6CQ COG CAC CGC GAG GGC AGC CGG CCG GCC CGG GGC 3212 
Ala Glu Arg Arg Ala Arg His Arg Glu Gly Ser Arg Pro Ala Arg Gly 
980 985 990 

GGC GAG GGC GAG GGC GAG GGC CCC GAC GGG GGC GAG CGC AGG AGA AGG 3260 
Gly Glu Gly Glu Gly Glu Gly Pro Asp Gly Gly Glu Arg Arg Arg Arg 
99S 1000 1005 

CAC CGG CAT GGC GCT CCA GCC ACG TAC GAG GGG GAC GCG CGG AGG GAG 3308 
His Arg His Gly Ala Pro Ala Thr Tyr Glu Gly Asp Ala Arg Arg Glu 
1010 1015 1020 

GAC AAG GAG CGG AGG CAT CGG AGG AGG AAA GAG AAC CAG GGC TCC GGG 3356 
Asp Lys Glu Arg Arg His Arg Arg Arg Lys Glu Asn Gin Gly Ser Gly 
1025 1030 1035 1040 

GTC CCT GTG TCG GGC GCC AAC CTG TCA ACC ACC CGG CCA ATC CAG CAG 3404 
Val Pro Val Ser Gly Pro Asn Leu Ser Thr Thr Arg Pro He Gin Gin 
1045 1050 1055 

GAC CTG GGC CGC CAA GAC CCA CCC CTG GCA GAG GAT ATT GAC AAC ATG 3452 
Asp Leu Gly Arg Gin Asp Pro Pro Leu Ala Glu Asp He Asp Asn Met 
1060 1065 1070 

AAG AAC AAC AAG CTG GCC ACC GCG GAG TCG GCC GCT CCC CAC GGC AGC 3500 
Lys Asn Asn Lys Leu Ala Thr Ala Glu Ser Ala Ala Pro His Gly Ser 
1075 1080 1085 

CTT GGC CAC GCC GGC CTG CCC CAG AGC CCA GCC AAG ATG GGA AAC AGC 3548 
Leu Gly His Ala Gly Leu Pro Gin Ser Pro Ala Lys Met Gly Asn Ser 
1090 1095 1100 

ACC GAC CCC GGC CCC ATG CTG GCC ATC CCT GCC ATG GCC ACC AAC CCC 3596 
Thr Asp Pro Gly Pro Met Leu Ala He Pro Ala Met Ala Thr Asn Pro 
1105 1110 1115 1120 

CAG AAC GCC GCC AGC CGC CGG ACG CCC AAC AAC CCG GGG AAC CCA TCC 3644 
Gin Asn Ala Ala Ser Arg Arg Thr Pro Asn Asn Pro Gly Asn Pro Ser 
1125 1130 1135 

AAT CCC GGC CCC CCC AAG ACC CCC GAG AAT AGC CTT ATC GTC ACC AAC 3692 
Asn Pro Gly Pro Pro Lys Thr Pro Glu Asn Ser Leu He Val Thr Asn 
1140 1145 1150 

CCC AGC GGC ACC CAG ACC AAT TCA GCT AAG ACT GCC AGG AAA CCC GAC 3740 
Pro Ser Gly Thr Gin Thr Asn Ser Ala Lys Thr Ala Arg Lys Pro Asp 
1155 1160 1165 

CAC ACC ACA GTG GAC ATC CCC CCA GCC TGC CCA CCC CCC CTC AAC CAC 3788 
His Thr Thr Val Asp He Pro Pro Ala Cys Pro Pro Pro Leu Asn His 
1170 1175 1180 
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ACC GTC GTA CAA GTG AAC AAA AAC GCC AAC CCA GAC CCA CTG CCA AAA 3836 
Thr Val Val Gin Val Asn Lys Asn Ala Asn Pro Asp Pro Leu Pro Lys 
1185 1190 1195 1200 

AAA GAG GAA GAG AAG AAG GAG GAG GAG GAA GAC GAC CGT GGG GAA GAC 3884 
Lys Glu Glu Glu Lys Lys Glu Glu Glu Glu Asp Asp Arg Gly Glu Asp 
1205 1210 1215 

GGC OCT 7^ CCA ATG CCT CCC TAT AGC TCC ATG TTC ATC CTG TCC ACG 3932 
Gly Pro Lys Pro Met Pro Pro Tyr Ser Ser Met Phe He Leu Ser Thr 
1220 1225 1230 

ACC AAC CCC CTT CGC CGC CTG T6C CAT TAC ATC CTG AAC CTG CGC TAC 3980 
Thr Asn Pro Leu Arg Arg Leu Cys His Tyr He Leu Asn Leu Arg Tyr 
1235 1240 1245 

TTT GAG ATG TGC ATC CTC ATG GTC ATT GCC ATG AGC AGC ATC GCC CTG 4028 
Phe Glu Met Cys He Leu Met Val He Ala Met Ser Ser He Ala Leu 
1250 1255 1260 

GCC GCC GAG GAC CCT GTG CAG CCC AAC GCA CCT CGG AAC AAC GTG CTG 4076 
Ala Ala Glu Asp Pro Val Gin Pro Asn Ala Pro Arg Asn Asn Val Leu 
"65 1270 1275 1280 

CGA TAC TTT GAC TAC GTT TTT ACA GGC GTC TTC ACC TTT GAG ATG GTG 4124 
Arg Tyr Phe Asp Tyr Val Phe Thr Gly Val Phe Thr Phe Glu Met Val 
1285 1290 1295 

ATC AAG ATG ATT GAC CTG GGG CTC GTC CTG CAT CAG GGT GCC TAC TTC 4172 
He Lys Met He Asp Leu Gly Leu Val Xieu His Gin Gly Ala Tyr Phe 
1300 1305 1310 

CGT GAC CTC TGG AAT ATT CTC GAC TTC ATA GTG GTC AST GGG GCC CTG 4220 
Arg Asp Leu Trp Asn He Leu Asp Phe He Val Val Ser Gly Ala Leu 
1315 1320 1325 

GTA GCC TTT GCC TTC ACT GGC AAT AGC AAA GGA AAA GAC ATC AAC ACG 4268 
Val Ala Phe Ala Phe Thr Gly Asn Ser Lys Gly Lys Asp He Asn Thr 
1330 1335 1340 

ATT AAA TCC CTC CGA GTC CTC CGG GTG CTA CGA CCT CTT AAA ACC ATC 4316 
He Lys Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys Thr He 
"45 1350 1355 1360 

AAG CGG CTG CCA AAG CTC AAG GCT GTG TTT GAC TGT GTG GTG AAC TCA 4364 
Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val Asn Ser 
1365 1370 1375 

CTT AAA AAC GTC TTC AAC ATC CTC ATC GTC TAC ATG CTA TTC ATG TTC 4412 
Leu Lys Asn Val Phe Asn He Leu He Val Tyr Met Leu Phe Met Phe 
1380 1385 1390 

ATC TTC GCC GTG GTG GCT GTG CAG CTC TTC AAG GGG AAA TTC TTC CAC 4460 
He Phe Ala Val Val Ala Val Gin Leu Phe Lys Gly Lys Phe Phe His 
1395 1400 1405 
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TGC ACT GAC GAG TCC AAA GAG TTT GAG AAA GAT TGT CGA GGC AAA TAC 4S0B 
Cys Thr Asp Glu Ser Lys Glu Phe Glu hys Asp Cys Arg Gly Lys Tvr 
1410 1415 1420 ^ 

CTC CTC TAC GAG AAG AAT GAG GTG AAG GCG CGA GAC CGG GAG TGQ AA6 4556 
Leu Leu Tyr Glu Lys Asn Glu Val Lys Ala Arg Asp Arg Glu Trp Lys 
"25 1430 1435 1440 

AAG TAT GAA TTC CAT TAC GAC AAT GTG CTG TGG GCT CTG CTG ACC CTC 4604 
Lys Tyr Glu Phe His Tyr Asp Asn Val Leu Trp Ala Leu Leu Thr Leu 

1445 1450 1455 

TTC ACC GTG TCC ACG GGA GAA GGC TGG CCA CAG GTC CTC AAG CAT TCG 4652 
Phe Thr Val Ser Thr Gly Glu Gly Trp Pro Gin Val Leu Lys His Ser 
1460 1465 1470 

GTG GAC GCC ACC TTT GAG AAC CAG GGC CCC AGO CCC GGG TAC CGC AT6 4700 
Val Asp Ala Thr Phe Glu Asn Gin Gly Pro Ser Pro Gly Tyr Arg Met 
1475 1480 1485 

GAG ATG TCC ATT TTC TAC GTC GTC TAC TTT GTG GTG TTC CCC TTC TTC 4748 
Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro Phe Phe 
1490 1495 1500 

TTT GTC AAT ATC TTT GTG GCC TTG ATG ATC ATC ACC TTC CAG GAG CAA 4796 
Phe Val Asn He Phe Val Ala Leu He He He Thr Phe Gin Glu Gin 
1505 1510 1515 1520 

GGG GAC AAG ATG ATG GAG GAA TAC AGC CTG GAG AAA AAT GAG AGG GCC 4844 
Gly Asp Lys Met Met Glu Glu Tyr Ser Leu Glu Lys Asn Glu Arg Ala 
1525 1530 1535 

TGC ATT OAT TTC GCC ATC AGC GCC AAG CCG CTG ACC CGA CAC ATG CCG 4892 
Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arg His Met Pro 
1540 1545 1550 

CAG AAC JIAG CAG AGC TTC CAG TAC CGC ATG TGG CAG TTC GTG GTG TCT 4940 
Gin Asn Lys Gin Ser Phe Gin Tyr Arg Met Trp Gin Phe Val Val Ser 
1555 1560 1565 

CCG CCT TTC GAG TAC ACG ATC ATG GCC ATG ATC GCC CTC AAC ACC ATC 4988 
Pro Pro Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn Thr He 
1570 1575 1580 

GTG CTT ATG ATG AAG TTC TAT GGG GCT TCT GTT GCT TAT GAA AAT GCC 5036 
Val Leu Met Met Lys Phe Tyr Gly Ala Ser Val Ala Tyr Glu Asn Ala 
1585 1590 1595 1600 

CTG CGG GTG TTC AAC ATC GTC TTC ACC TCC CTC TTC TCT CTG GAA TGT 5084 
Leu Arg Val Phe Asn He Val Phe Thr Ser Leu Phe Ser Leu Glu Cys 
1605 1610 1615 

GTG CTG AAA GTC ATG GCT TTT GGG ATT CTG AAT TAT TTC CGC GAT GCC 5132 
Val Leu Lys Val Met Ala Phe Gly He Leu Asn Tyr Phe Arg Asp Ala 
1620 1625 1630 
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TTO AAC ATC TTC SAC TTT GTG ACT GTT CTG GGC AGC ATC ACC GAT ATf 
Trp Asn lie Phe Asp Phe Val Thr Val Leu Oly Jle tS 2J JIJ 

1640 1645 

SS Si ?S S2 121 m ^ AAT AAC TTC ATC AAC CTG AGC TTT 

^ lie Asn Leu Ser Phe 

1655 1660 

CTC CGC CTC TTC CGA GCT GCC CGG CTC ATC AAA CTT CTC CGT GAG GGT 
Leu^Arg Leu Phe Arg Ala^Ala Arg Leu lie Lys S 2S ^ ^ 

^ iS iJe ^ T?I TGG ACC TTT GTG CAG TCC TTC AAG GCC 

Tyr Thr lie Arg lie Leu Leu Trp Thr Phe Val Gin Ser Phe Lys Ala 
1685 1690 1695 

2S ?S ?^ 7^ ATC GCC ATG CTC TTC TTC ATC TAT GCC 

Leu Pro Tyr Val Cys Leu Leu He Ala Net Leu Phe Phe lie Tyr Ala 
^^00 1705 1710 

T?f T^" ^ 5^ ^ *^ AAC ATT GGC ATC GAC QTO SAG Q»C 

lie He Gly Met Qln Val Phe Gly Asn He Gly lie ^ ^ ^ 
1^15 1720 1725 ^ 

GAG GAC ACT GAT GAA GAT GAG TTC CAA ATC ACT GAG CAC AAT AAC TTC 

^0 Vits^^^ ^ S!o *^ ^ 

^ II'^ GCC CTC ATG CTT CTC TTC CGG AGT GCC ACC GGG 

Arg Thr Phe Phe Gin Ala Leu Met Leu Leu Phe Arg i^r Ma ?S S 

" 1*^50 1755 1760 

GAA GCT TGG CAC AAC ATC ATG CTT TCC TGC CTC AGC GGG AAA CCG TOT 
Glu Ala Trp His Asn He Met Leu Ser Cys Leu Ser lly ^ So ^ 
1765 1770 1775 

WVG AAC TO GGC ATC CTG ACT CGA GAG TCT GGC AAT GAA TTT GCT 
Asp Lys Asn Ser Gly He Leu Thr Arg Glu Cys Oly Asn Glu ^ Ala 
1780 1785 173Q 

TAT m TAC TTT GTT TCC TTC ATC TTC CTC TGC TOG TTT CTQ ATG CTG 
Tyr Phe Tyr Phe Val Ser Phe He Phe Leu Cys Ser Phe Leu Met 2u 
1795 1800 1805 

AAT CTC TTT GTC GCC GTC ATC ATG GAC AAC TTT GAG TAC CTC ACC CGA 
Asn l^u Phe Val Ala Val He Met Asp Asn Phe Glu Tyr Leu Tte S 
1810 1815 1820 

GAC TCC TCC ATC CTG GGC CCC CAC CAC CTG GAT GAG TAC QTG CGT GTC 
Asp Ser Ser He Leu Gly Pro His Bis Leu Asp Glu Tyr Val Arg Val 

1830 1835 1840 

^ I^^ GCA GCT TGG GGC CGC ATG CCT TAC CTG GAC 

Trp Ala Glu Tyr Asp Pro Ala Ala Trp Gly Arg Met Pro Tyr Leu Asp 
1845 1850 1855 



5180 



5228 



5276 



5324 



5372 



5420 



5468 



5516 



5564 



5612 



5660 



5708 



5756 



5804 
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CTG ATC 

Leu He 
1905 

AAA CAG 
Lys Gin 



CCC AAT 
Pro Asn 



^ ^ s; M jj? - - - «= 

1880 1885 

^ f"^ ^^'^ TTC AAT TCC ACC Cic ATG OCT 

Sso ^ "^"^ T^.''*^ His Phe Asn Ser Thr S Si? 

1895 1900 

^ 5^ t"^ GCC AAG GGA GGA GCC GAC 

Arg Thr Ala Leu^Asp He Lys lie Jla^Lys Gly Gly Ala ^ 

CAG ATC GAC GOT GAG CTG CGG AAG GAG ATG ATG GCG ATT TGG 
Gin Met Asp Ala Glu Leu Arg Lys Glu Met Met Ala lie S 
"25 1930 3^335 

CTGTCCCMAAGA06CTASACCT0CTGQTCACACCTCACAAG 
Leu Ser Gin Lys Thr Leu Asp Leu Leu Val ?S 2s ^ 

IMS 1950 

IS S £2S ?S SS S S 5? Si JS? i2? S S,-? 

I960 1965 

^ If^ ^ °^ AAO GCC AAG AAG CTG CAG GCC ATG CGC CSac 

1975 1980 

GAG CAG GAC CGG ACA CCC CTC ATG TTC CAG CGC ATG GAG CCC CCG TCC 
Glu Gin ASP Arg Thr Pro Leu Met Phe Gin Arg 2fS S £o s2 

1995 2000 

CCA ACQ CAG GAA GGG GGA CCT GQC CAG AAC GCC CTT cep Trn anf 
Pro Thr Gin Glu Gly Gly Pro Gly ^ ?ro llr ?S 

2005 2010 2015 

CTG GAC CCA GGA GGA GCC CTG ATG OCT cm: GAA AGC GGC CTC AAG Gar- 
Leu Asp Pro Gly Gly Ala Leu Met Ala His IJS ^ eij 2S S2 
202O 2025 2030 

AGC CpG TCC TGG GTQ ACC CAG CGT GCC CAG GAG ATC TTC CAG AAG arr 
^° Val Thr Gin Axg Ala Gin Glu Se 

2035 2040 2045 

^ iS?^ ®^ <3GC CCC CCT ACC GAC ATG CCC AAC AGC 

Gly Thr Trp Ser Pro Glu Gin Gly Pro Pro Thr Asp Met Pro JJn Jer 
^^^^ 2055 2060 

^ CI^ 'TCC GTG GAG ATG CGA GAG ATG GGC AGA GAT GGC 

Gln^Fro Asn ser Gin Ser^Val Glu Met Arg Glu^Met Sy ^ Sy 



5852 



5900 



5948 



5996 



6044 



6092 



6140 



6188 



6236 



6284 



6332 



6380 



6428 



6476 



200 



TAG TCC GAC AGC GAG CAC TAG CTC CCC ATG GAA GGC GAG GGC CGG GCT 
Tyr ser Asp Ser Glu His Tyr Leu Pro Met Qlu Gly Gin Gly iS Ala 
2085 2090 2095 

GCC TCC ATG CCC CGC CTG GCT GCA GAG AAC GAG AGG AGA AGG GGC CGG 
Ala Ser Met Pro Arg Leu Pro Ala Qlu Asn Gin Arg Arg Arg Gly Arg 
2100 2105 2110 

^ ^ ^ ^ TCA GAC ACC AGC CCC ATG AAG 

Pro Arg Gly Asn Asn Leu Ser Thr lie Ser Asp Thr Ser Pro Met Lys 
2115 2120 2125 

CGT TCA GCC TCC GTG CTG GGC CGG AAG GCC CGA CGC CTG GAC GAT TAG 
Arg ser Ala Ser Val Leu Gly Pro Lys Ala Arg Arg Leu Asp A^ Sr 
2130 2135 2140 ^ ^ y 

If ^ fl!! ^ ^ S^? GAG GAG AAC CAG CGG CAC CAC CAG CGG 

ser Leu Glu Arg Val Pro Pro Glu Glu Asn Gin Arg His His Gin Arg 

2150 2155 2160 

CGC CGC GAG CGG AGC GAC CGG GCC TCT GAG CGC TCC CTG GGC CGC TAG 
Arg Arg Asp Arg Ser His Arg Ala Ser Glu Arg Ser Leu Gly Arg Tyr 
2165 2170 2175 

^^^^f^^<^'^(^J<CAGACCTG.l^^ ATG ACC ACC CAA 
Thr Asp Val Asp Thr Gly Leu Gly Thr Asp Leu Ser Met Thr Thr Gin 
2180 2185 2190 

TCC GOG GAG CTG CGG TC6 AAG GAG CGG GAC CAG GAG CGG GGC CGG GCC 
ser Gly Asp Leu Pro Ser Lys Glu Arg Asp Gin Glu Arg Gly Arg Pro 
2195 2200 2205 

AAG GAT OGG AAG CAT CGA CAG CAG CAC CAC CAC CAC CAC CAC GAC CAC 

His His His His His His His His His 
2210 2215 2220 

CAT CCC CGG CCC GCC GAC AAG GAC CGC TAT GCC CAG GAA CGG GCG GAC 

Asp Arg Tyr Ala Gin Glu Arg Pro Asp 
2225 2230 2235 2240 

CAC GGC CGG GCA CGG GCT CGG GAC GAG CGC TGG TCC CGC TCG CCC AGC 
Hia Gly Arg Ala Arg Ala Arg Asp Gin Arg Trp Ser Arg Ser Pro Ser 
2245 2250 2255 

GAG GGC CGA GAG CAC ATG GCG CAC CGG CAG TAGTTCCGTA AGTGGAAGCC 
Glu Gly Arg Glu His Met Ala His Arg Gin 
2260 2265 

CAGCCCGCTC AACATCTGGT ACCAGCACTC CGCGGCGGGG CCGCCGCCAG CTCCCCCAGA 

CCCCCTCCAC CCCCCGGCCA CACGTGTCCT ATTCCCCTGT GATCCGTAAG GCCGGCGGCT 

CGGGGCCCCC GCAGCAGCAG CAGCAGCAGG AGGCGGTGGC CAGGCCGGGC GGGGCGGCGA 

GGAGCGGCGC TCGGAGGTAC CCAGGCCCCA CGGCCGAGCC TCTGGCCGGA GATCGGCCGC 



6524 

6572 

6620 

6668 

6716 

6764 

6812 

6860 

6906 

6956 

7004 

7054 

7114 
7174 
7234 
7294 



-201- 



CCAGGCCCGG 
GCATCTGGCC 
GGCTCCGACT 
GGGGCCTACX3 
CCCAGGACTC 
CTCCCCAACG 
AAGGGCCTGC 
TGGCGCCCGC 



CCACGGGGGG CGACAGCAGC GGCC6CTCGC CCAGGATGGA GAGGCGGGTC 
CCCGGAGCGA GTCCCCCAGG GCCTGTCGAC ACGGCGGGGC CCGGTCGCCG 
CGCACGTGTC CGAGGGGCCC CCGGGTCCCC 6GCACCATGG CTACTACCGG 
ACOACXaUSGC CGATGGCCCG GGCAGCGGGG GCGGCGAGGA GGCCATCGCC 
ACGCGCCACC CCCCGTACGA CACGCGTCCT CGGGCGCCAC CGGGCGCTCG 
CCCGGGCCTC GGGCCCGGCC TGCGCCTCGC CTTCTCGGCA CGGCCGGCGA 
GCTACTACCC GGCGCACGGA CTGGCCAGGC CCCGCGGGCC GGGCTCCAGG 
ACGAACCCTA CAGCGAGAGT GACGATGATT GGTGCTAAGC CCGGGCGAGG 
CCGGCCCCCC ACGCACC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 7032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) ' 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION; 166.. 6921 

(D) OTHER INFORMATION: /atandard_naine= -Alpha-1E-1« 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCOVG AGGCGGTGGT CCCCGTOCTT 
CTCTGGATGC GGCTCTGAGT CTCCGTGTGT CTTTCTGCTT GTTGClt3TGT GCGGGTGTTC 
GGCCGCGATC ACCTTTGTCT GTCTTCTGTC TCTTTAAACC TCAGG ATG GCT CGC 

Met Ala Arg 

^ ^ S S2 55 S ^ - - - 0^5 ^ S 

10 15 

^« ^« AGC AGG AAC CGG CAA GGA ACC CCC GTG CCG GCC TCG GGG CAG 
ABP Gin Ser Arg Asn Arg Gin Gly Thr Pro Val Pro Ali ?S ^ S2 

25 30 35 

f ?f TAG AAG CAG ACQ AAA GCA CAG AGG GCG CGG ACT ATG GCT 

Ala Ala Ala Tyr Lys Gin Thr Lys Ala Gin Arg Ala Arg SJ? SI 

45 50 



7354 

7414 

7474 

7534 

7594 

7654 

7714 

7774 

7791 



60 
120 
174 

222 

270 

318 



-202- 



S2 ^ S^? """^ "^-^ TTC ACC GTC AAC AGA 

Leu Tyr Asn Pro lie Pro Val Arg Gin Asn Cys Phe Thr Val Asn ^ 
S5 60 65 ^ 

T?^ S?* ^^"^ ATT GTC AGG AAA TAT GCC AAG 

Ser Leu Phe lie Phe Gly Glu Asp Asn lie Val Arg Lys Tyr Ala Lys 

75 80 

S» r"^ I!? CCA TTT GAG TAC ATG ATC CTG GCC ACC ATC 

Lys Leu lie Asp Trp Pro Pro Phe Glu Tyr Met lie Leu Ala Thr lie 
"5 90 95 

SI ^"^^ CTG GCC CTG GAG CA6 CAT CTT CCT GAG GAT 

lie Ala Asn Cys He Val Leu Ala Leu Glu Gin His Leu Pro Glu ^p 

no 115 

if ^ ^'^^ <^ CTG GAG AAG ACA QAA CCT TAT TTC 

Asp Lys Thr Pro Met Ser Arg Arg Leu Glu Lys Thr ^ pS ^ S 
120 125 130 

S? lie S^I ST" ^ *^ ATT GTG GCC CTG GOG 

lae Gly lie Phe cys Phe Glu Ala Gly He Lys lie Val Ala Leu Gly 
"5 140 3^45 ' 

t^f ^ <3GC TCT TAC CTC CGC AAT GGC TCG AAT GTC ATG 

Phe He Phe His Lys Gly Ser Tyr Leu Arg Asn Gly Trp Asn vS Met 

155 160 

GAC TTC ATC GTG GTC CTC AGT GGC ATC CTG GCC ACT GCA GGA ACC CAC 
Asp Phe He val Val Leu Ser Gly He Leu Ala Thr ^ ^ gs 

-^^^ 170 175 

Se JJI ^ ^ AGG ACC CTC CGG GCT GTG CGT GTC CTG 

i^ne Asn Thr His Val Asp Leu Arg Thr Leu Arg Ala Val Arg Val Leu 

^ ^ ?ZP r"*^ ATA CCT AGC CTG CAG ATT GTG TTG 

Arg Pro Leu Lys Leu Val Ser Gly He Pro Ser Leu Gin He Val Leu 
200 205 210 

AAG TCC ATC ATG AAG GCC ATG 6TA CCT CTT CTG CAG ATT GGC CTT CTS 
Lys ser He Met Lys Ala Net Val Pro Leu Su S §S S 2S 
215 220 225 

S!; ^ ATC CTC ATG m 6CT ATC ACT QGT TTC GAG TTC TAC 

Leu 
240 

«?f, ^ TE!t S^T ??® TF^ ATC AAC AAT TCA GGT ATT CTA 

Asn 
255 

^ ^ S?! CCA TGT GGT GTG CAG GGC TGC CCA GCT 

Val Gin Gly Cys Pro Ala 
270 275 



fAii iCI XT . ^'-'^ "^^^ ^ A^ A^ GGT TTG GAG TTC TAC 

Leu Phe Phe Ala lie Leu Met Phe Ala He He Gly Leu Glu Phe T^ 
230 235 ^ 

"aa; niZ ^ ^ A™ AAC AAT TCA GGT ATT CTA 
ser Gly Lys Leu Has Arg Ala Cys Phe Met Asn Asn Ser Gly He Leu 
245 250 

™ GGA TTT GAC CCC CCT CAC CCA TGT GGT GTG CAG GGC TGC CCA GCT 
Glu Gly Phe Asp Pro Pro His Pro Cys Gly Val Gin Gly Cys Pro Ala 

265 — . 



366 



414 



462 



510 



558 



606 



654 



702 



750 



798 



846 



894 



942 



990 



•203- 



290 

305 

320 

350 355 
365 370 

380 

430 435 
" 445 450 

460 4g5 

4eo 

490 495 



1038 



1086 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



-204- 



1758 



510 53^5 

22 ?s ?s s S SS S S ^ J.-? ^ ™ - 

525 
540 

?S ^ ^ Si v=S S ilS S iSS s ^ 

555 5go 

5S 5S ^ JS Si ^ °^ ^ ATA 

565 ^ ^ Arg Ala Leu Arg Leu Leu Arg He 

570 

580 " i.yB Tyr Trp Ala Ser Leu Arg Asn Leu Val Val Ser 

585 

?S JI^ S 2S S iS? S S5 



"0 625 

o35 



640 



?£ s s is; s s SI s SI s X s s 

700 705 

s ^ Si s ^ s; Si i?j s ^ 



1854 



1902 



1950 



1998 



2046 



2094 



650 655 *^ 

665 670 675 

685 690 



2334 



720 



-205- 



730 

|S S.-? - 1^ SI? J" SS S M Si 

750 

765 

780 785 

S ^ S «^ ^ 2S 52 «C «C CTC «T OCC 

790 r?" Asn Pro Leu Asn Ala 

800 

2J ^ ^ ^ CCC jac GCC ATT a«0 GGC CTO GCC CTG 

805 ^ <3ly I«u Ala Leu 

815 

835 

850 

880 

890 895 

910 33^5 



2430 



2478 



2526 



2574 



2622 



2670 



2718 



2766 



2814 



2862 



-206- 



990 



995 

GAG CCA €3AA G6C AGC 

^ i5«='" 

S S 22 ^ S2 - ^ S ?S 



1055 



"8° 1085 1090 

vJ? ^ ^ ^ ^ SAG AAG CCT GAQ ACA GGC AAA GCr 

Val Glu Lya Lys Gin Lys i,ys Glu Lya Arg S2 S ^ 

^ 1100 1105 

is? ?S ^ ^ JJ? £S S ^ JI^ 

S M ^ s ^ ?s i2S 2S ^ ^ 

1130 1135 

11^ JT^ if? ^ AGC AGC ATC GCC CTC GC6 6CA GAG 6AC 

Ile^Leu Leu Val He Ala Ala Ser Ser He Ala I*u Ala IS Su ?^ 
11*5 1150 j^j^gg 

^« ^ t"^ AAC AAA 6TC CTG AG6 TAT TTT GAC 

Pro Val Leu Thr Asn Ser Glu Arg Asn Lys Val Leu Arg T^ Sp 
1160 1165 1170 



3198 



3246 



3294 



1015 1020 1025 

1035 1040 



J-OeS X070 1075 



3486 



3582 



3630 



3678 



-207- 



1180 1185 
J-^^S 1200 

1210 1215 

2S f r° S?* AAC AAA 6GA CGG GAC 

S20 * ^ I-y^ Gly Arg Asp 

1225 1230 



^ys sier i«u Arg Val Leu Arg Val Leu Arg Pro Leu 

CGC TTG CCC AAG CTC AAG GCC GTC TTC GAC Tt3C rsra 
Arg Leu Pro Lye Leu Lys Ala ^ ^ ^ SJ? 

^-^^5 1260 

AA6 AAT GTC TTC AAC ATA CTC ATT OTG TAG AAG CTC 

1270 1275 



ATC AAG ACC ATC 
lie Lys Thr He 
1235 

AAA ACC ATC AAG 
Lys Thr He Lys 
1250 

GTG ACC TCC TTG 
Val Thr Ser Leu 
1265 

TTC ATG TTC ATC 
Phe Met Phe He 
1260 



1290 1295 
^305 1310 1315 



AC6 GAC 
Thr Asp 
1300 

GAT CAC 
Asp His 



CAT 6AA 
His Glu 



QAGAAAAACAAGATGGAGGTGAA6GGCCGGGAATGGAAGCGC 
Glu Lys Asn^Lys Met Glu Val Ly^^Gly Arg Glu ?S S 

S?S£?S.^----gA^^CTG«^^^^^ 

S £2S ^ S f s s.-? 

"85 1390 1395 



3726 



3774 



3822 



3870 



3918 



3966 



4014 



4062 



4110 



4158 



4206 



4254 



4302 



4350 



-208- 



GTC AAT ATC TTT GTG GCT 
Val Asn lie Phe Val Ala 
1400 

GAT AAG ATG ATG GAG GAG 
Asp Lys Met Met Glu Glu 
1415 

ATC GAC TTC GCC ATC AGC 
He Asp Phe Ala He Ser 
1430 

AAC AGA CAC ACC TTC CAG 
Asn Arg His Thr Phe Gin 
1445 



CTC ATC ATC ATC ACC TTC CAG GAG CAA GGG 
Leu He He He Thr Phe Gin Glu Gin Glv 
1405 1410 ^ 

TGC AGC CTG GAG AAG AAT GAG AGG GCG TGC 
Cys fier Leu Glu Lys Asn Glu Arg Ala Sb 
1420 1425 

GCC AAA CCT CTC ACC CGC TAG ATG CC6 CAG 
Ala Lys Pro Leu mr Arg Tyr Met Pro ^ 
-^^^^ 1440 

TAC CGC GTG TGG CAC TTT GTG GTG TCT CC6 
Tyr Arg val Trp His Phe Val Val IS Pro 
1450 1455 



TCC TTT GAG TAC ACC ATT ATG 

f^L^*"® ^^"^ ^ lie Met 

1460 i4g5 

CTG ATG ATG AAG TAT TAT TCT 
Leu Met Met Lys Tyr Tyr Ser 
1480 

AAG TAC CTG AAT ATC GCC TTC 
Lys Tyr Leu Asn He Ala Phe 
1495 

CTG AAG GTC ATC GCT TTT GGC 
Leu Lys Val He Ala Phe Gly 
1510 



GCC ATG ATC GCC TTG AAT ACT GTT GTG 
Axa Met He Ala Leu Asn Thr Val Val 
1470 1475 

GCT CCC TGT ACC TAT GAG CTG GCC CTG 
Ala Pro Cys Thr Tyr Glu Leu Ala Leu 
1485 1450 

ACC ATG GTG TTT TCC CTG QAA TGT GTC 
Thr Met Val Phe Ser Leu Glu Cys Val 
1500 1505 

TTT TTG AAC TAT TTC CGA GAC ACC TGG 
Phe Leu Asn Tyr Phe Arg Asp Thr Trp 
^^15 1520 



1530 1535 

CTC ACA GAC AGC AAG CTG GTG AAC ACC AGT GGC TTC 
Leu^Thr Asp Ser Lys Leu^Val Asn Thr Ser Gly Se 



CTG AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTT 
Leu Lys Leu Phe Arg Ala Ala Arg S ?2 ^ 
1560 1565 

?S Jhr T?^ 7^ ACC TTT GTG CAG 

Tyr Thr He Arg He Leu Leu Trp Thr Phe Val Gin 
1575 1580 



AAT ATG AGC TTT 
Asn Met Ser Phe 
1555 

CTG CGT CAG GGC 
Leu Arg Gin Gly 
1570 

TCC TTT AAG GCC 
Ser Phe Lys Ala 
1585 



s s s S2 ?s M iss ^s ^ s; z^. x 

1610 1615 



4398 



4446 



4494 



4542 



4590 



4638 



4686 



4734 



4782 



4830 



4878 



4926 



4974 



5022 



1630 3^g3j, 
^^^^ 1650 

^ ^ S ^ ^ J£ ^ ™ S= »C 0,« T« 

1670 ^ ^ ™^ Tyr Val Tyr 

1680 

1710 3^73^5 
1725 173Q 
1740 1745 

1760 

IS ^ S? S; ^ ^ is S 2S jSS i1 r °- 

1765 ^ V^^A ™^ Val Ala 

1770 1775 



1790 

1810 

1820 1325 



5118 

5166 

5214 

5262 

5310 

5358 



5454 

5502 



5598 



5646 



5694 



-210- 



2S S S? S ^ Si S ^ ?P ^« 

1845 ilcn ^ Met Asp Tyr Tyr 

1850 1855 



AAG CAG ACT AAG GTG AAO AAG 
Lys Gin Ser Lys Val Lys Lys 

AAT GCC CCC ATG TTC CAG CGC 
Asn Ala Pro Met Phe Gin Arg 
1880 

ATC ATT GCT AAT GCC AAA GCC 
He He Ala Asn Ala Lys Ala 
1895 

TCA GQC CTG AGT GGC CGG AGT 
Ser Gly Leu Ser Gly Arg Ser 
1910 



^« S?^® ^ GAG GAA CAG AAA 

Gin Arg Qln Gin Leu Glu Glu Gin Lvb 
1870 1875 

ATG GAG CCT TCA TCT CTG CCT CAG GAG 
Met Glu Pro Ser Ser Leu Pro Gin Glu 
1885 1390 

CTG CCT TAC CTC CAG CAG GAC CCC GTT 
Leu Pro Tyr Leu Gin Gin Asp Pro Val 
1900 1905 

GGA TAC CCT TCG ATG AGT CCA CTC TCT 

Ser 

1920 



1930 1935 

ai a; s IS Su ^ ^ ATS 

1940 ^ 1945 ^ ^ Ser Ser Met 

1950 1955 

1965 1970 

SS SS - - S « gJ JL^ ?S IS 

^^^5 2000 

^UIO 2015 

^ ^ ^ 2P ^ AM aT CTT CTC TCT CCT OAT GTC 

Glu^Arg Arg Arg Ser Glu Arg Lys His Leu Leu S ^ ^ ^ 

^025 2030 2035 

I^r ^ ^ «^ ACC CAG GCT OAC TGG GAG TCC 

ser Arg cys Asn Ser Glu Glu Arg Gly Thr Gin Ala Asp ^ Ser 

2045 2050 
^"^^ 2060 2065 



5742 



5790 



5838 



5886 



5934 



5982 



6030 



6078 



6126 



6174 



6222 



6270 



6318 



6366 



■■■■-fj if-*d^ 



■211- 



'^"'^ 2080 

^^^^ 2115 

2f 2? S5 s s SI s r r «=<^ 

2130 

2S S S[ 2S SS ?^ ^ 

2135 " f f5« " Asn Ala Trp Leu Thr Glu 

2160 

2150 2195 

?s s ^ js; s s s= ™ - 

2200 MAC " ^ Arg 

2210 

His ?S SJ? ii^S 2g ^ S° f- COC CC3 

2215 ^ Arg Arg Arg Arg Arg Gly 

^^^o 2225 

2240 

^ S; IJJ JJ? ''^^'^'^^ TCCCCCCTCC OATGCAT^CT 

^^^^ 2250 



cc.«««. 

GAAGAGGGAA AAGGAAGATG GAAG 

(2) INFORMATION PGR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 7089 base pairs 



6414 

6462 

6510 

6558 

6606 

6654 

6702 

6750 

6798 

6846 

6894 

6948 

7008 
7032 
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(B) lYPE: nucleic acid 

(C) STRANDSnNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECOIiE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 166.-6978 

(D) OTHER INFORMATION: /standard^name« "Alpha-lE-3" 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
GCTGCTGCTG CCTCTCCGAA GAGCTCGCGG AGCTCCCCAG AGGCGGTGGT CCCCGTGCTT 
GTCTGGATGC GGCTCTGAGT CTCCGTGTGT CITTCTGCTT GTTGCO^ GOGGGTOTl^ 
GGCCGCGATC ACCTTTGTCT GTCTTCTGTC TGTTTAAACC TCAGG ATC GCT CGC 

Met Ala Arg 

10 15 

25 30 ^ 35 

S2 ^ ^ I?^ ^ ^ OCT 

i-yr i,ys Gin Thr Lys Ala Gin Arg Ala Arg Thr Met Ala 

75 eo 



nf """^ ATC CTG GCC ACC ATC 

Lys Leu lie Asp Trp Pro Pro Phe Glu Tyr Met lie Leu Ala ^ He 



95 



105 2,10 ^ 



115 

P i.ys Tlir Pro Met Ser Arg Arg Leu Glu Lys Thr Glu Pro Tyr Phe 

125 

ATT GGG ATC TTT TGC TTT GAA GCT GGG ATC AAA AIT GTG GCC CTG GGG 



60 
120 
174 

222 

270 

318 

366 

414 

462 



510 



558 



606 



■213- 



654 



702 



n. Oly II. ^ cy. „„ 81. «. „j II. II. val «. „y 

140 145 

^ ^ s,^ s s r <^ ™ 

aso ^ ^ Tyr Leu Arg Asn Gly Trp Asn Val Met 

195 

210 

220 225 

2S SI S 2S Si? ^ S ^ ^ TAc 

230 " lie «ly Leu Glu Phe Tyr 

^ ^ 2J S S2 Se' if ^ ^ 

245 ^ Vfn ''St Asn Asn Ser Gly He Leu 

255 

^"^0 275 
285 290 
305 

315 320 ^ 

330 

355 

AAA GAG AGA GAG AQA GTG GAG AAC CGA AGG GCT TTC ATC AAG CTG r«n 



846 



894 



942 



990 



1134 



1162 
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Lys Glu Arg Glu teg Val Glu Asn Arg Axg Ala Phe Met Lys Leu Arg 

^ ^.xa oin eon lie Glu Arg Glu Leu Asn Gly Tyr Arg Ala Trp He 

380 

410 

?S mI? ^ AGT GAT GAG CAC TCT GTT GAT ATC 

Glu Ala Met Thr teg Asp Ser Ser Asp Glu His Cys Vai ^ JI; 

"5 430 *^ 435 

445 450 
^ 455 *^ His Lys Glu Arg Leu Leu Arg He 



560 



If3 ST ^ ^^'^ CGA GCC CTC OGG CTT CTA AGA ATA 

ser Phe Gly He Ser Val Leu Arg Ala Leu teg Leu ^ S 

570 575 

TTT AAA ATA ACC AAG TAT TGG GCT TCC CTA CGG AAT TTG GTG GTC TCC 



1326 



1374 



1422 



1470 



1516 



1566 



490 495 

505 510 515 

CTt; GGA CTC TTC CTC TTG GAG ATC TCC CTC AAG ATG TAT GQC ATO ear 
Leu Gly Leu Phe gu Leu Glu Met Ser Leu Ly, ?^ ^ jS? 

CCT CGC CTT TAT TTT CAC TCT TCA TTC AAC TGC TTT GAT TTT (3Gr nn-n 
Pro Arg Leu Tyr Phe His Ser Ser Phe SI ^ S ^ ?S 

540 545 

?s ?s |S s s s s is £j 



1758 



1806 



1854 



1902 



1 qqo 
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Phe Lys lie Tto Lys ^ Trp Ala Ser Leu Arg Asn i*u Val Val Ser 

590 



610 



SI ?f f T?'^ '^TG ACT QTG TTC CAG ATC CTC MG GGT GAG QAC TCG 
Pro Ala Ala He Met Thr Val Phe Gin He Le« Thr ^ ^ ^ 

650 



670 

»^ S ^S 3?? ^ S 2; ST ?S iSS 

685 690 

?S 2i S iSJ ?2 S S S SI iSJ 

700 705 



■'IS 720 



AAC CAG iU^ C»T GCA CTG CA(S AAG GCC AAG GAG GTC AGC COS ATG TCT 
Asn Gin Lys His Ala Leu Gin Lys Ala Lys Glu pS ij? ^ 

730 735 

GCA CCC AAC ATG CCT TOG ATC 6AA AGA GAC AGA AGG AQA AGA CAC CAC 
Ala Pro Asn Met Pro Sjr He Glu Arg Asp Arg Arg ^ 2s £s 

750 755 

S« S2 ^ S?" ^''^ SAG CGG AGG CGC 

Met ser Met Trp Glu Pro Arg Ser Ser His Leu Arg Glu Arg Arg Arg 
'60 765 

2f 5^ I^*^ ^ ACC AGC CAG CTG AGG AAG 

Arg His Hxa Met Ser Val Trp Glu Gin Arg Thr Ser Gin Leu ^ ^ 

2s ^ mI^ ®^ '''^ CTC AAC AGA GAG GAG GCG CCG 

His Met Gin Met Ser Ser Gin Glu Ala Leu Asn Arg Glu Glu aS pS 

795 eoo 

ACC ATG AAC CCG CTC AAC CCC CTC AAC CCG CTC AGC TCC CTC AAC CCa 



2046 



2094 



2142 



2190 



2238 



2286 



2382 



2430 



2478 



2526 



2574 



-216- 



Thr Met Asn Pro Leu Asn Pro Leu Asn Pro Leu Ser Ser Leu Asa Pro 

BIO 815 



CTC AAT GCC CAC CCC 
Leu Asn Ala His Pro 
820 

CTG GCC CTG GGC CTG 
I-eu Ala l«eu Gly Leu 
840 

CGT GGG GGG TCC CTC 
Arg Gly Gly Ser Leu 
655 

QAC AAC CAG AGG ACC 
Asp Asn Gin Arg Thr 
670 



A6C CTT TAT CGG CGA CCC AGG GCC ATT GAG GGC 
Ser Leu Tyr Arg Arg Pro Arg Ala He Glu ^ 

635 



GCC CTG GAG AAG TTC GAG GAG GAG CGC ATC AGC 
Ala Leu Glu Lys Phe Glu Glu Glu Arg lie s£ 

AAG GGG GAT GGA GGG GAC CGA TCC AGT GCC CTG 
Lys Gly Asp Gly Gly Asp Arg Ser S^ Sa 2S 
860 

CCT TTG TCC CTG GGC CAG CGG GAG CCA CCA TOG 
Pro Leu ser Leu Gly Gin Arg S ?S 

880 

BBS ^ fti^ ^® Asp Pro Thr Gin Gin Glu Ala 

850 895 



GGG GGA GGA GAG GCT 
Gly Gly Gly Glu Ala 
900 

CAG AGC CAA CGG CGC 
Gin Ser Gin Arg Arg 
920 

GAG TCC TCT TCA GCC 
Glu Ser Ser Ser Ala 
935 

CTG GAT GAA GCC ATG 
Leu Asp Glu Ala Met 
950 



vJ? ^ GAG GAC CGG GCC AGO CAC AGG 

Val Val Thr Phe Glu Asp Arg Ala Arg His Arg 

910 



r S^"^ ^^"^ GTC AGG ACA GAA GGC AAG 

ser Arg Hxs Arg Arg Val Arg Thr Glu Gly ^s 
^25 

TCC CGG AGC AGG TCT GCC AGC CAG GAA CGC AGT 
ser Arg Ser Arg Ser Ala Ser Gin Glu ^g sSJ 
940 

CCC ACT GAA GGG GAG AAG GAC CAT GAG CTC AGG 
Pro Thr Glu Gly Glu Lys Asp His Glu Leu ^ 



970 975 



GAT TTA AGG AGG ACC AAC 
Asp Leu Arg Arg Thr Asn 
980 985 

GCA GGA GGC CTT GAT GAG 
Ala Gly Gly Leu Asp Glu 
1000 

GAG CTG GAA CTG GGG AAG 
Glu Leu Glu Val Gly Lys 
1015 



ACT CTG ATG GTG TCC AGA GGC TCC GGG CTG 
Ser Leu Met Val Ser Arg Gly Ser Gly Leu 
990 995 

GCT GAC ACC CCC CTA GTC CTG CCC CAT CCT 
Ala Asp Thr Pro Leu Val Leu Pro His Pro 
1005 1010 

CAC GTG GTG CTG ACG GAG CAG GAG CCA GAA 
His Val Val Leu Thr Glu Gin Glu Pro Glu 
1020 1025 



2670 



2718 



2766 



2614 



2862 



2910 



2958 



3006 



3054 



3102 



3150 



3198 



3246 



GGC AGC AGT GAG CAG GCC CTG CTG GGG AAT GTG CAG CTA GAC ATG oar 
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Gly ser Ser Glu Gin Ma Leu Jeu oiy Asn Val cm Leu Asp Met Gly 

1035 1Q4Q 

1O50 1055 

1070 1075 



"80 1085 1090 ^ 
1100 

1110 ^ ^ 5*??=°^° I-ys Glu Thr 

1115 1120 



1135 



3342 



3390 



3438 



3486 



3534 



3582 



3678 



3726 



1150 ' 1155 

^^^5 1180 11B5 

1195 1200 
mI? tT"" GGC TTG ATC CTQ CAQ gat C3GG TCC TAG TTC CGA 

1210 1215 

^ TTG TTO AAC ATC CTG GAC TTT 6TC GTG GTC GTT GGC OCA rra GTG 
jBp Leu Trp Asn lie Leu Asp Phe Val Val Val Val ^ S ?S 

1225 1230 1235 

Iff P° ^ ^ ^ «3A ACC AAC AAA GGA CGG GAC ATC 

Ala Phe Ala Leu Ala Asn Ala Leu Gly Thr Asn Lys Gly S J^P ill 

1245 1250 
AAG ACC ATC AA6 TCT CTG CGG GTG CTC CGA GTT CTA AGG cra rrr, *7.» 



3822 



3870 



3918 
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Lys Thr He hya Ser Leu Arg Val Leu Arg Val Leu Arg Pro Leu Lys 
1255 1260 1265 

ACC ATC AAG CGC TTG CCC AAG CTC AAG GCC GTC TTC GAC TGC GTA GTG 
Tlir He Lys Arg Leu Pro Lys Leu Lys Ala Val Phe Asp Cys Val Val 
1270 1275 1280 

ACC TCC TTG AAG AAT GTC TTC AAC ATA CTC ATT GTG TAC AAG CTC TTC 
Thr Ser Leu Lys Asn Val Phe Asn He Leu He Val Tyr Lys Leu Phe 
1285 1290 1295 

ATG TTC ATC TTT GCT GTC ATC GCA GTT CAG CTC TTC AAG GGA AAG TTC 

l-ys Gly Lys Phe 

1300 1305 1310 1315 

TCT TAT TGC ACT GAC AGT TC^ ATA GGC 

Fne Tyr Cys Thr Asp Ser Ser Lys Asp Thr Glu Lys Glu Cys He Gly 
1320 1325 1330 

AAC TAT GTA GAT CAC GAG AAA AAC AAG ATG GAG GTG AAG GGC CGG GAA 
Asn Tyr Val Asp His Glu Lys Asn Lys Met Glu Val Lys Gly Arg Glu 
1335 1340 1345 

TGG AAG CGC CAT QAA TTC CAC TAC GAC AAC ATT ATC TGG GCC CTG CTG 
Trp Lys Arg His Glu Phe His Tyr Asp Asn He He Trp Ala Leu Leu 
1350 1355 i3gQ 

ACC CTC TTC ACC GTC TCC ACA QGG GAA GGA TGG CCT CAA GTT CTG CAG 
^ ^ ^1« Gly Trp Pro Gin Val Leu Gin 

1365 1370 1375 

CAC TCT OTA GAT GTG ACA GAG GAA GAC CGA GGC CCA AGC CGC AGC AAC 

^ Arg Gly Pro Ser Arg Ser Asn 

1380 1385 1390 1395 

CGC ATG GAG ATG TCT ATC TTT TAT GTA GTC TAC TTT GTG GTC TTC CCC 
Arg Met Glu Met Ser He Phe Tyr Val Val Tyr Phe Val Val Phe Pro 
1400 1405 1410 

TTC TTC TTT GTC AAT ATC TTT GTG GCT CTC ATC ATC ATC ACC TTC CAG 
Phe Phe Phe Val Asn He Phe Val Ala Leu He He He Thr Phe Gin 
1415 1420 1425 

GAG CAA GGG GAT AAG ATG ATG GAG GAG TGC AGC CTG GAG AAG AAT GAG 
Glu Gin Gly Asp Lys Met Met Glu Glu Cys Ser Leu Glu Lys Asn Glu 
1430 1435 1440 

AGG GCG TGC ATC GAC TTC GCC ATC AGC GCC AAA CCT CTC ACC CGC TAC 
Arg Ala Cys He Asp Phe Ala He Ser Ala Lys Pro Leu Thr Arq Tvr 
1445 1450 1455 ^ ^ 

ATG CCG CAG AAC AGA CAC ACC TTC CAG TAC CGC GTG TGG CAC TTT GTG 
Met Pro Gin Asn Arg His Thr Phe Gin Tyr Arg Val Trp His Phe Val 

1465 1470 1475 



4014 



4062 



4110 



4156 



4206 



4254 



4302 



4350 



4396 



4446 



4494 



4542 



4590 



GTG TCT CCG TCC TTT GAG TAC ACC ATT ATG GCC ATT, arr nrr Tm hbt 



A a n 
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Val Ser Pro Ser Phe Glu Tyr Thr He Met Ala Met He Ala Leu Asn 
"80 X485 1490 

ACT GTT GTG CTG ATG ATG AAG TAT TAT TCTT GCT CCC TGT ACC TAT GAG 
Thr Val Val Leu Met Met Lys Tyr Tyr Ser Ala Pro Cys mr Tyr Glu 
^^^5 1500 1505 

CTG GCC CTG AAG TAC CTG AAT ATC GCC TTC ACC ATG GTG TTT TCC CTG 
Leu Ala I-eu Lys Tyr Leu Asn He Ala Phe Thr Met Val Phe Ser Leu 
1510 1515 1520 

GAA TOT GTC CTG AAG GTC ATC GCT TTT G6C TTT TTG AAC TAT TTC CGA 
Glu Cys Val Leu Lys Val He Ala Phe Gly Phe Leu Asn Tyr Phe Arg 
1525 1530 1535 ^ ^ 

GAC ACC TGG AAT ATC TTT GAC TTC ATC ACC GTG ATT GGC AGT ATC ACA 
Asp Thr Trp Asn He Phe Asp Phe He Thr Val He Gly Ser He Thr 

1545 1550 1555 

GAA ATT ATC CTG ACA GAC AGC AAG CTG GTC AAC ACC AGT GGC TTC AAT 
Glu He He Leu Thr Asp Ser Lys Leu Val Asn Thr Ser Gly Phe Asn 
1560 1565 2.510 

ATG AGC TTT CTC AAG CTC TTC CGA GCT GCC CGC CTC ATA AAG CTC CTG 
Met Ser Phe Leu Lys Leu Phe Arg Ala Ala Arg Leu He Lys Leu Leu 
1575 1580 1585 

CGT CAG GGC TAT ACC ATA CGC ATT TTC CTC TGG ACC TTT GTC CAG TCC 
Arg Gin Gly Tyr Thr He Arg He Leu Leu Trp Thr Phe Val Gin Ser 
1590 1595 1600 

TTT AAG GCC CTC CCT TAT GTC TCC CTT TTA ATT GCC ATC CTT TTC TTC 

5*?^^* ^ ^« 11« Ala Met Leu Phe Phe 

1605 1610 1615 

ATT TAT GCC ATC ATT GGG ATC CAG GTA TTT GGA AAC ATA AAA TTA GAC 
ilt^^ ®ly ®1^ Val Phe Gly Asn He Lys Leu Asp 

1620 1625 1630 1655 

GAG GAG AGT CAC ATC AAC CGG CAC AAC AAC TTC CGG AGT TTC TTT GGG 
Glu Glu Ser His He Asn Arg His Asn Asn Phe Arg Ser Phe Phe Gly 
1640 1645 1650 

TCC CTA ATC CTA CTC TTC AGG AGT GCC ACA GGT GAG GCC TGG CAG GAG 
Ser Leu Met Leu Leu Phe Arg Ser Ala Thr Gly Glu Ala Trp Gin Glu 
1655 1660 1665 

ATT ATC CTC TCA TGC CTT GGG GAG AAG GGC TCT GAG CCT GAC ACC ACC 
He Met Leu Ser Cys Leu Gly Glu Lys Gly Cys Glu Pro Asp Thr Thr 
1670 1675 1680 

GCA CCA TCA GGG CAG AAC GAG AAT GAA CGC TCC GGC ACC GAT CTC GCC 
Ala Pro Ser Gly Gin Asn Glu Asn Glu Arg Cys Gly Thr Asp Leu Ala 
1685 1690 1695 



4686 



4734 



4782 



4830 



4878 



4926 



4974 



5022 



5070 



5118 



5166 



5214 



5262 



TAC 6TO TAC TTT GTC TCC TTC ATC TTC TTC TOP TCC TTC TTTJ UTa r-rr 
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Tyr Val Tyr Phe Val Ser Phe lie Phe Phe Cys Ser Phe Leu Met Leu 
1700 1705 1710 1715 

AAC CTG TTT GTG GCC GTC ATC ATG GAC AAC TTT GAG TAG CTG ACT CGG 5358 
Asn Leu Phe Val Ala Val lie Met Asp Asn Phe Glu Tyr Leu Thr Arg 
1720 1725 1730 

GAC TCC TCC ATC CTG GGG CCT CAC CAC TTG GAC GAG TTT GTC C6C GTC 5406 
Asp Ser Ser He Leu Gly Pro His His Leu Asp Glu Phe Val Arg Val 
1735 1740 1745 

TGQ GCA GAA TAT GAC CGA GCA GCA TGT GGC CGC ATC CAT TAC ACT GAG 5454 
Trp Ala Glu Tyr Asp Arg Ala Ala Cys Gly Arg He His Tyr Thr Glu 
1750 1755 1760 

ATG TAT GAA ATG CTG ACT CTC ATG TCA CCT CCG CTA GGC CTC GGC AAG 5502 
Met Tyr Glu Met Leu Thr Leu Met Ser Pro Pro Leu Gly Leu Gly Lys 
1765 1770 1775 

AGA TGT CCC TCC AAA GTG GCA TAT AAG AGG TTG GTC CTG ATG AAC ATG 5550 
Arg Cys Pro Ser Lys Val Ala Tyr Lys Arg lieu Val Leu Met Asn Met 
1780 1785 1790 1795 

CCA GTA GCT GAG GAC ATG ACG GTC CAC TTC ACC TCC ACA CTT ATG GCT 5598 
Pro Val Ala Glu Asp Met Thr Val His Phe Thr Ser Thr Leu Met Ala 
1800 1805 1810 

CTG ATC CGG ACA GCT CTG GAC ATT AAA ATT GCC AAA GGT GGT GCA GAC 5646 
Leu He Arg Thr Ala Leu Asp He Lys He Ala Lys Gly Gly Ala Asp 
1815 1820 1825 

AGG CAG CAG CTA GAC TCA GAG CTA CAA AAG GAG ACC CTA GCC ATC TGG 5694 
Arg Gin Gin Leu Asp Ser Glu Leu Gin Lys Glu Thr Leu Ala He Trp 
1830 1835 1840 

CCT CAC CTA TCC CAG AAG ATG CTG GAT CTG CTT GTG CCC ATG CCC AAA 5742 
Pro His Leu Ser Gin Lys Met Leu Asp Leu Leu Val Pro Met Pro Lys 
1845 1850 1855 

GCC TCT GAC CTG ACT GTG GGC AAA ATC TAT GCA GCA ATG ATG ATC ATG 5790 
Ala Ser Asp Leu Thr Val Gly Lys He Tyr Ala Ala Met Met He Met 
I860 1865 1870 1875 

GAC TAC TAT AAG CAG AGT AAG GTG AAG AAG CAG AGG CAG CAG CTG GAG 5838 
Asp Tyr Tyr Lys Gin Ser Lys Val Lys Lys Gin Arg Gin Gin Leu Glu 
1880 1885 1890 

GAA CAG AAA AAT GCC CCC ATG TTC CAG CGC ATG GAG CCT TCA TCT CTG 5886 
Glu Gin Lys Asn Ala Pro Met Phe Gin Arg Met Glu Pro Ser Ser Leu 
1895 1900 1905 

CCT CAG GAG ATC ATT GCT AAT GCC AAA GCC CTG CCT TAC CTC CAG CAG 5934 
Pro Gin Glu He He Ala Asn Ala Lys Ala Leu Pro Tyr Leu Gin Gin 
1910 1915 1920 

GAC CCC GTT TCA GGC CTG AGT GGC CGG AGT GGA TAC CCT TCG ATG AGT 5982 
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Asp Pro Val Ser 6ly Leu Ser Gly Arg Ser Gly Tyr Pro Ser Met Ser 
1925 1930 1935 

CCA CTC TCT CCC CA6 GAT ATA TTC CAG TTG GCT TGT ATG GAC CCC GCC 6030 
Pro Leu Ser Pro Gin Asp lie Phe Gin lieu Ala Cys Met Asp Pro Ala 
1940 1945 1950 1955 

GAT GAC GGA CAG TTC CAA GAA CGG CAG TCT CTG 6TG GTG ACA GAC CCT 6078 
Asp Asp Gly Gin Phe Gin Glu Arg Gin Ser Leu Val Val Thr Asp Pro 
1960 1965 1970 

AGC TCC ATG AGA CGT TCA TTT TCC ACT ATT CGG GAT AAG CGT TCA AAT 6126 
Ser Ser Met Arg Arg Ser Phe Ser Thr lie Arg Asp Lys Arg Ser Asn 
1975 1980 1985 

TCC TCG TGG TTG GAG GAA TTC TCC ATG GAG CGA AGC AGT GAA AAT ACC 6174 
Ser Ser Trp Leu Glu Glu Phe Ser Met Glu Arg Ser ser Glu Asn Thr 
1990 1995 2000 

TAC AAG TCC CGT CGC CGG AGT TAG CAC TCC TCC TTG CGG CTG TCA GCC 6222 
Tyr Lys Ser Arg Arg Arg Ser Tyr His Ser Ser Leu Arg Leu Ser Ala 
2005 2010 2015 

CAC CGC CTG AAC TCT GAT TCA GGC CAC AAG TCT GAC ACT CAC CCC TCA 6270 
His Arg Leu Asn Ser Asp Ser Gly His Lys Ser Asp Thr His Pro Ser 
2020 2025 2030 2035 

GGG GGC AGG GAG CGG CGA CGA TCA AAA GAG CGA AAG CAT CTT CTC TCT 631B 
Gly Gly Arg Glu Arg Arg Arg Ser Lys Glu Arg Lys His lieu Leu Ser 
2040 2045 2050 

CCT GAT GTC TCC CGC TGC AAT TCA GAA GAG CGA GGG ACC CAG GCT GAC 636(5 
Pro Asp Val Ser Arg Cys Asn Ser Glu Glu Arg Gly Thr Gin Ala Asp 
2055 2060 2065 

TGG GAG TCC CCA GAG CGC CGT CAA TCC AGG TCA CCC AGT GAG GGC AGG 6414 
Trp Glu Ser Pro Glu Arg Arg Gin Ser Arg Ser Pro Ser Glu Gly Arg 
2070 2075 2080 

TCA CAG ACQ CCC AAC AGA CAG GGC ACA GGT TCC CTA AGT GAG AGC TCC 6462 
Ser Oln Thr Pro Asn Arg Gin Gly Thr Gly Ser Leu Ser Glu Ser Ser 
2065 2090 2095 

ATC CCC TCT ore TCT GAC ACC AGC ACC CCA AGA AGA AGT CGT CGG CAG 6510 
lie Pro Ser Val Ser Asp Thr Ser Thr Pro Arg Arg Ser Arg Arg Gin 
2100 2105 2110 2115 

CTC CCA CCC GTC CCG CCA AAG CCC CGG CCC CTC CTT TCC TAC AGC TCC 6558 
Leu Pro Pro Val Pro Pro Lys Pro Arg Pro Leu Leu Ser Tyr Ser Ser 
2120 2125 2130 

CTG ATT CGA CAC GCG GGC AGC ATC TCT CCA CCT GCT GAT GGA AGC GAG 6606 
Leu lie Arg His Ala Gly Ser He Ser Pro Pro Ala Asp Gly Ser Glu 
2135 2140 2145 

GAG GGC TCC CCG CTG ACC TCC CAA GCT CTG GAG AGC AAC AAT GCT TGG 6654 
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Glu Gly Ser Pro l,eu Thr Ser Gin Ala Leu Glu Ser Asn Asn Ala Trp 
2150 2155 2160 

CTG ACC GAG TCT TCC AAC TCT CCG CAC CCC CAG CAG AGG CAA CAT GCC 6702 
Leu Thr Glu Ser Ser Asn Ser Pro His Pro Gin Gin Arg Gin His Ala 
2165 2170 2175 

TCC CCA CAG C6C TAC ATC TCC GAG CCC TAC TTG GCC CTG CAC GAA GAC 6750 
Ser Pro Gin Arg Tyr lie Ser Glu Pro Tyr Leu Ala Leu His Glu Asp 
2180 2185 2190 2195 

TCC CAC GCC TCA GAC TGT GTT GAG GAG GAG ACG CTC ACT TTC GAA GCA 6798 
Ser His Ala Ser Asp Cys Val Glu Glu Glu Thr Leu Thr Phe Glu Ala 
2200 2205 2210 

GCC GTG GCT ACT AGC CTG GGC CGT TCC AAC ACC ATC GGC TCA GCC CCA 6846 
Ala Val Ala Thr Ser Leu Gly Arg Ser Asn Thr He Gly Ser Ala Pro 
2215 2220 2225 

CCC CTG CGG CAT AGC TGG CAG ATG CCC AAC GGG CAC TAT CGG CGG CGG 6894 
Pro Leu Arg His Ser Tip Gin Met Pro Asn Gly His Tyr Arg Arg Arg 
2230 2235 2240 

AGG CGC GGG GGG CCT GGG CCA GGC ATG ATG TGT GGG GCT GTC AAC AAC 6942 
Arg Arg Gly Gly Pro Gly Pro Gly Met Met Cys Gly Ala Val Asn Asn 
2245 2250 2255 

CTG CTA AGT GAC ACG GAA GAA GAT GAC AAA TGC TAGAGGCTGC TCCCCCCTCC 6995 
Leu Leu Ser Asp Thr Glu Glu Asp Asp Lys Cys 
2260 2265 2270 

GATGCATGCT CTTCTCTCAC ATGGAGAAAA CCAAGACAGA ATTGGGAAGC CAGTGCGGCC 7055 
CCGCGGGG2U3 GAAGAGGQAA AAGGAAGATG GAAG 7089 
(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2634 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDH6SS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1983 

(D) OTHER INFORMATION: /standard_ziainei= "Beta-2d" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

ATG GTC CAA AGG GAC ATG TCC AAG TCT CCT CCC ACA CCG GCG GCG GCG 48 
Met Val Gin Arg Asp Met Ser Lys Ser Pro Pro Thr Pro Ala Ala Ma 
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"^5 10 15 

GTG QCG CAG GAG ATC CM AT6 GAA CTG CTA GAG AAC GTG GCT CCC GCG 96 
Val Ala Gin Glu lie Gin Met Glu Leu Leu Glu Asn Val Ala Pro Ala 
20 25 30 

GOG GCG CTC GGA GCC GCC GCA CAG TCA TAT GGA AAA GGA GCC AGA AGG 144 
Gly Ala Leu Gly Ala Ala Ala Gin Ser Tyr Gly Lys Gly Ala Arg Axg 
35 40 45 

AAA AAC AGA TTT AAA GGA TCT GAT GGA AGC ACG TCA TCT GAT ACT ACC 192 
Lys Afin Arg Phe Lys Gly Ser Asp Gly Ser Thr Ser Ser Asp Thr Thr 
50 55 go 

TCA AAT AGT TTT GTT CGC CAG GGT TCG GCA GAC TCC TAG ACT AGC CGT 240 
Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg 

70 75 80 

CCA TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC 288 
Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg 
B5 90 95 

AGA GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA 336 
Arg Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr 
100 105 no 

AAG CCC GTT GCA TTT GCG GTT CGG ACA AAT GTC AGC TAG AGT GCG GCC 384 
Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala 
115 120 125 

CAT GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA 432 
His Glu Asp Asp Val Pro Val Pro Gly Met Ala lie Ser Phe Glu Ala 
130 135 140 

AAA GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA 480 
Lys Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp lie 
145 150 155 160 

GGG CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA 528 
Gly Arg Leu Val Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro 
165 170 175 

GTC AAA CTA GAA AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA 576 
Val Lys Leu Glu Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin 
180 185 190 

GGG AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG 624 
Gly Lys Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu 
195 200 205 

GGT GAC ATA GTA CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT 672 
Gly Asp He Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala 
210 215 220 

ATA GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA 720 
He Asp He Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro 
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225 230 235 



240 



GCA AAC CAC CGC TCC CCT AAA CCC AST GCA AAC AGT GTA ACG TCA CCC 768 
Ala Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro 
245 250 255 

CAC TCC AAA OAG AAA AGA ATG CCC TTC TTT AAG AA6 ACA GAG CAC ACT 816 
His Ser Lys Glu Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr 
260 265 270 

CCT CCG TAT GAT GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GT6 GGC 864 
Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly 
275 280 285 

CCT TCT CTG AAG GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG 912 
Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu 
290 295 300 

TTT GAT TTT TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG 960 
Phe Asp Phe Leu Lys His Arg Phe Glu Gly Arg lie Ser lie Thr Arg 
305 310 315 320 

GTC ACC GCT GAC ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC 1008 
Val Thr Ala Asp lie Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro 
325 330 335 

AGT AAG CAC GCA ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG 1056 
Ser Lys His Ala lie lie Glu Arg Ser Asn Thr Aarg Ser Ser Leu Ala 
340 345 350 

GAA GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG 1104 
Glu Val Gin Ser Glu lie Glu Arg He Phe Glu Leu Ala Arg Thr Leu 
355 360 365 

CAG TTG GTG GTC CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC 1152 
Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu 
370 375 380 

AGT AAA ACC TCC TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT 1200 
Ser Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser 
385 390 39S 400 

CCT AAG GTT TTA CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA OCT 1248 
Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala 
405 410 415 

AAA CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT 1296 
Lys His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys 
420 425 430 

CCT CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT 1344 
Pro Pro Glu Leu Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp 
435 440 445 

GCC TOT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC 1392 
Ala Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr 
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455 460 

CAT CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA 1440 
His Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu 

470 475 480 

GCC ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG 1488 
Ala Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin 
485 490 495 

GOT TCT CAA GGT GAT CAS AGG ACT GAT CGC TCC GOT CCT ATC CGT TCT 1536 
Gly Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser 
500 505 510 

GCT TCC CAA GCT GAA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC 1584 
Ala Ser Gin Ala Glu Glu Qlu Pro Ser Val Glu Pro Val Lys Lys Ser 
515 520 525 

»G CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG 1632 
Gin Hxs Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Glv 
530 535 540 

ACA AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG 1680 
Thr Ser Arg Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin 
545 550 555 560 

GAG AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT 1728 
Glu Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His 
565 570 575 

GAC CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC 1776 
Asp His Val Asp His Tyr Ala Ser His Arg Asp His Asn His Arg Asp 
580 585 590 

GAG ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT 1824 
Glu Thr His Gly Ser Ser Asp His Arg His Arg Glu Ser Arg His Arg 
595 600 605 

TCC CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG 1872 
Ser Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin 
CIO 615 620 

CGC AGC CGT CAT AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA 1920 
Arg Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu 
625 630 635 640 

GTG ATA TCA AAA AAA CGG AAT GAG GCT GGG GAG TOG AAC AGG GAT GTT 1968 
Val He Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val 
645 650 655 

TAC ATC CCC CAA TGAGTTTTGC CCTTTTGTGT T T m ' rmT TTTTTTTTGA 2020 
Tyr He Pro Gin 
660 

AGTCTTGTAT AACTAACAGC ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTCCAATC 2080 
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ATATGTGATC TGTCTTGTAA TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG 2140 

GATAGAGTAT TGAGATACTT TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT 2200 

GCAGTCCTCC GGTTGCATAC TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA 2260 

CAAAATCTGT TGCCACCCAG GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCAA 2320 

CAAGCCAGAA TCAGCACAGA TAAAAAGTGG AATTTCTT6T TTCTCCAGAT TTTTAATACG 2380 

TTAATACGCA GGCATCTGAT TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTGTACC 2440 

TCTGGCTGAC TAAATTTGGG GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG 2500 

TTAGAATCTA TTTTCTATGT ACTAGTACTG TGTACTGTAT AGACTIGTTTG TAAATGTTAT 2560 

TTCTGCAAAC AAACACCTCC TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC 2620 

ACTATTTTAG AGTC 2634 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: DNA (genomic) 

(ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 69.. 1631 

(D) OTHER INFORMATION: /standard_ziame« "Beta-4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AGCCCA6CCT CGGGGGCCAG CCCCCTCC6C CCACCGCACA CGGGCTGGCC ATGCGGCGGC 60 

TCTGAACG ATG TCC TCC TCC TCC TAC GCC AAG AAC GGG ACC GCG GAC GGG 110 
Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly 
15 10 

CCG CAC TCC CCC ACC TCG CAG GTG GCC CGA GGC ACC ACA ACC CGG AGG 158 
Pro His Ser Pro Thr Ser Gin Val Ala Arg Gly Thr Thr Thr Arg Arg 
15 20 25 30 

AGC AGG TTG AAA AGA TCC GAT GGC AGC ACC ACT TCG ACC AGC TTC ATC 206 
Ser Arg Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe lie 
35 40 45 

CTC AGA CAG GGT TCA GCO GAT TCC TAC ACA AGC AGG CCG TCT GAC TCC 254 
Leu Arg Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 
50 55 60 
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GAT GTC TCT TTG GAA GAG GAC CGG GAA GCA ATT CGA CAG GAG AGA GAA 302 
Asp Val Ser Leu Glu Qlu Asp Arg Glu Ala lie Arg Gin 61u Arg Glu 
65 70 75 

CAG CAA GCA GCT ATC CAG CTT GAG AGA GCA AAG TCC AAA CCT GTA GCA 350 
Gin Gin Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala 
80 85 90 

TTT GCC GTG AAG ACA AAT GTG AGC TAG TGC GGC 6CC CTG GAC GAG GAT 396 
Phe Ala Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp 
95 100 105 110 

GTG CCT GTT CCA AGC ACA GCT ATC TCC TTT GAT GCT AAA GAC TTT CTA 446 
Val Pro Val Pro Ser Thr Ala He Ser Phe Asp Ala Lys Asp Phe Leu 
115 120 125 

CAT ATT AAA GAG AAA TAT AAC AAT GAT TGG TGS ATA GGA AGG CTG GTG 494 
His He Lys Glu Lys Tyr Asn Asn Asp Trp Trp lie Gly Arg Leu Val 
130 135 140 

AAA GAG GGC TGT GAA ATT GGC TTC ATT CCA AGT CCA CTC AGA TTG GAG 542 
Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Leu Arg Leu Glu 
145 150 155 

AAC ATA CGG ATC CAG CAA GAA CAA AAA AGA GGA CGT TTT CAC GGA GGG 590 
Asn He Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly 
160 165 170 

AAA TCA AGT GGA AAT TCT TCT TCA AGT CTT GGA GAA ATG GTA TCT GGG 638 
Lys Ser Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Net Val Ser Gly 
175 180 185 190 

ACA TTC CGA GCA ACT CCC ACA TCA ACA GCA AAA CAG AAG CAA AAA GTG 686 
Thr Phe Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val 
195 200 205 

ACG GAG CAC ATT CCT CCT TAC GAT GTT GTA COG TCA ATG CGT CCG GTG 734 
Thr Glu His He Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val 
210 215 220 

GTG TTA GTG GGG CCG TCA CTG AAA GGT TAC GAG GTA ACA GAC ATG ATG 782 
Val Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met 
225 230 235 

CAG AAA GCC CTC TTT GAT TCC CTG AAG CAC AGG TTT GAT GGG AGG ATT 830 
Gin Lys Ala Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg He 
240 245 250 

TCA ATA ACG AGA GTG ACA GCT GAC ATT TCT CTT GCT AAG AGG TCT GTC 878 
Ser He Thr Arg Val Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val 
255 260 265 270 

CTA AAT AAT CCC AGC AAG AGA GCA ATA ATT GAA CGT TCG AAC ACC CGG ' 926 
Leu Asn Asn Pro Ser Lys Arg Ala He He Glu Arg Ser Asn Thr Arg 
275 280 285 
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TCC AGC TTA GCG GAA GTA CAA AGT GAA ATT GAA AGA ATC TTT GAG TTG 974 
Ser Ser Leu Ala Glu Val Gin Ser Glu He Glu Arg He Phe Glu Tjeu 
290 295 300 

GOl AGA TCT TTG CAA CTG GTT GTT CTT GAT GCA GAG ACC ATC AAT CAC 1022 
Ala Arg Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His 
305 310 315 

CCA GCA CAA CTT ATA AAG ACT TCC TTA GCA CCA ATT ATT GTT CAT GTA 1070 
Pro Ala Gin Leu He Lys Thr Ser Leu Ala Pro He He Val His Val 
320 325 330 

AAA GTC TCA TCT CCA AAG GTT TTA CAG CGG TTG ATT AAA TCT AGA GGA 1118 
Lys Val Ser Ser Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly 
335 340 345 350 

AAG TCA CAA AGT AAA CAC TTG AAT GTT CAA CTG GTG GCA GCT GAT AAA 1166 
Lys Ser Gin Ser Lys His Leu Asn Val Gin Leu Val Ala Ala Asp Lys 
355 360 365 

CTT GCA CAA TGC CCC CCA GAA ATG TTT GAT GTT ATA TTG GAT GAA AAT 1214 
Leu Ala Gin Cys Pro Pro Glu Met Phe Asp Val He Leu Asp Glu Asn 
370 375 380 

CAG CTT GAG GAT GCA TGT GAA CAT CTA GGG GAG TAC CTG GAG GCG TAC 1262 
Gin Leu Glu Asp Ala Cys Glu His Leu Gly Glu Tyr Leu Glu Ala Tyr 
385 390 395 

TGG CGT GCC ACC CAC ACA ACC AGT AGC ACA CCC ATG ACC CCG CTG CTG 1310 
Trp Arg Ala Thr His Thr Thr Ser Ser Hir Pro Met Thr Pro Leu Leu 
400 405 410 

GGA AGG AAT TTG GGC TCC ACG GCA CTC TCA CCA TAT CCC ACA GCA ATT 1358 
Gly Arg Asn Leu Gly Ser Thr Ala Leu Ser Pro Tyr Pro Thr Ala He 
415 420 425 430 

TCT GGG TTA CAG AGT CAG CGA ATG AGG CAC AGC AAC CAC TCC ACA GAG 1406 
Ser Gly Leu Gin Ser Gin Arg Met Arg His Ser Asn His Ser Thr Glu 
435 440 445 

AAC TCT CCA ATT GAA AGA CGA AGT CTA ATG ACC TCT GAT GAA AAT TAT 1454 
Asn Ser Pro He Glu Arg Arg Ser Leu Met Thr Ser Asp Glu Asn Tyr 
450 455 460 

CAC AAT GAA AGG GCT CGG AAG AGT AGG AAC CGC TTG TCT TCC AGT TCT 1502 
His Asn Glu Arg Ala Arg Lys Ser Arg Asn Arg Leu Ser Ser Ser Ser 
465 470 475 

CAG CAT AGC CGA GAT CAT TAC CCT CTT GTG GAA GAA GAT TAC CCT GAC 1550 
Gin His Ser Arg Asp His Tyr Pro Leu Val Glu Glu Asp Tyr Pro Asp 
480 485 490 

TCA TAC CAG GAC ACT TAC AAA CCC CAT AGG AAC CGA GGA TCA CCT GGG 1598 
Ser Tyr Gin Asp Thr Tyr Lys Pro His Arg Asn Arg Gly Ser Pro Gly 
495 500 505 510 
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GGA TAT AGC CAT OAC TCC CGA CAT AGG CTT TGAC5TCTAAT GAAACAAAAA 1648 
Gly Tyr Ser His Asp Ser Arg His Arg Leu ib48 
515 520 

ATATTCATCT GTTGACAATT TGCCATAGCA GTGCTAGGAT AAACCAATCA TCTTAACTTG 1708 

GCTAACATAG CACAGTATTT ACTGTGCTAA TGGGCTGCTG TCATTTTATG CTAAGTAAGG 1768 

GGCAAAAAAA AAAATTACAT TATGCCCTTG AOTCTAGATG GATATTAQAT GCCCG 1823 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Ser Ser Ser Ser Tyr Ala Lys Asn Gly Thr Ala Asp Gly Pro His 

^ 5 10 15 

Ser Pro Thr Ser Gin Val Ala Arg Gly Thr Thr Thr Arg Arg Ser Arg 
20 25 30 

Leu Lys Arg Ser Asp Gly Ser Thr Thr Ser Thr Ser Phe He Leu Ara 
35 40 45 

Gin Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser Asp Val 

50 55 60 . 

Ser Leu Glu Glu Asp Arg Glu Ala He Arg Gin Glu Arg Glu Gin Gin 
^5 70 75 80 

Ala Ala He Gin Leu Glu Arg Ala Lys Ser Lys Pro Val Ala Phe Ala 
85 90 95 

Val Lys Thr Asn Val Ser Tyr Cys Gly Ala Leu Asp Glu Asp Val Pro 
100 105 110 

Val Pro Ser Thr Ala He Ser Phe Asp Ala Lys Asp Phe Leu His He 
115 120 125 

Lys Glu Lys Tyr Asn Asn Asp Trp Trp He Gly Arg Leu Val Lys Glu 
130 135 140 

Gly Cys Glu He Gly Phe He Pro Ser Pro Leu Arg Leu Glu Asn He 
145. 150 155 160 

Arg He Gin Gin Glu Gin Lys Arg Gly Arg Phe His Gly Gly Lys Ser 
165 170 175 

Ser Gly Asn Ser Ser Ser Ser Leu Gly Glu Met Val Ser Gly Thr Phe 
180 185 190 
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Arg Ala Thr Pro Thr Ser Thr Ala Lys Gin Lys Gin Lys Val Thr Glu 
195 200 205 

His He Pro Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu 
210 215 220 

Val Gly Pro Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys 
225 230 235 240 

Ala Leu Phe Asp Ser Leu Lys His Arg Phe Asp Gly Arg He Ser He 
245 250 255 

Thr Arg Val Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val Leu Asn 
260 265 270 

Asn Pro Ser Lys Arg Ala He He Glu Arg Ser Asn Thr Arg Ser Ser 
275 280 285 

Leu Ala Glu Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg 
290 295 300 

Ser Leu Gin Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala 
305 310 315 320 

Gin Leu He Lys Thr Ser Leu Ala Pro He He Val His Val Lys Val 
325 330 335 

Ser Ser Pro Lys Val Leu Gin Arg Leu He Lys Ser Arg Gly Lys Ser 
340 345 350 

Gin Ser Lys His Leu Asn val Gin Leu Val Ala Ala Asp Lys Leu Ala 
355 360 365 

Gin Cys Pro Pro Glu Met Phe Asp Val He Leu Asp Glu Asn Gin Leu 

370 375 380 

Glu Asp Ala Cys Glu His Leu Gly Glu Tyr Leu Glu Ala Tyr Trp Arg 
385 390 395 400 

Ala Thr His Thr Thr Ser Ser Thr Pro Met Thr Pro Leu Leu Gly Arg 
405 410 415 

Asn Leu Gly Ser Thr Ala Leu Ser Pro Tyr Pro Thr Ala He Ser Gly 
420 425 430 

Leu Gin Ser Gin Arg Met Arg His Ser Asn His Ser Thr Glu Asn Ser 
435 440 445 

Pro He Glu Arg Arg Ser Leu Met Thr Ser Asp Glu Asn Tyr His Asn 
450 455 460 

Glu Arg Ala Arg Lys Ser Arg Asn Arg Leu Ser Ser Ser Ser Gin His 
465 470 475 480 



Ser Arg Asp His Tyr Pro Leu Val Glu Glu Asp Tyr Pro Asp Ser Tyr 
485 490 495 
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Gin Asp Thr Tyr Lys Pro His Arg Asn Arg Gly Ser Pro Gly Gly Tyr 
500 505 510 

Ser His Asp Ser Arg His Arg Leu 
515 520 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



Cix) P3SATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35. .3346 

(D) OTHER INFORMATION: /standard__name= "Alpha-2a" 

(ix) FEATURE: 

(A) MAMB/KBY: 5'UTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3347.-3636 



(xi) SEQUENC!E DESCRIPTION: SEQ ID NO: 29: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG OCT OCT GGC TGC CT6 

Net Ala Ala Gly Cys Leu 
1 5 

CTG C3CC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 
Leu Ala Leu Thr Leu Thr Leu Pbe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TOG GTG GAT 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG 6CA AAA ACA GCA AGT GGA GTC 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 
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GAG AAA CTT CTG AGC AAC AGA TCT JAA 6CC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA QCG GAG AAA GTT CAA GCA GCT CAC CA6 TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Tzp Arg Glu Asp Phe Ala 
105 110 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AG6 ATA AAA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro 61y Ser Gin Arg He Lys Pro Val Phe He 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Olu 

170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 626 
I<eu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Tirp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT , 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Tksp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 
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AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val lieu Lys Asp Ala Val Asn Asn He Thr 

315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT HOB 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAG AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro He Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu He Pro Ser He Gly Ala He Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 1348 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin T3:p Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CC6 GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 1492 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT MA AGA CTG ACA CCA 1540 
val Met Gly Val Asp Val Ser Leu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG CCT ATT GGT GTA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Pro He Gly Val 
520 525 530 
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6GT ATA CCA ACA ATT AAT TTA AGA AAA A6G AGA CCC AAT ATC CAQ AAC 1684 
Qly lie Pro Thr lie Asn Leu Arg Lys Arg Arg Pro Asn lie Gin Asn 
535 540 545 550 

CCC AAA TCT CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA 1732 
Pro Lys Ser Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu 
555 560 565 

GAG AAT GAT ATT AM GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA 1780 
Glu Asn Asp lie Lys Val Glu lie Arg Asn Lys Net lie Asp Gly Glu 

570 575 580 

AGT GGA GAA AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA 1828 
Ser Gly Glu Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg 
585 590 595 

TAT ATT GAG AAA GGA AAC AGG ACA TAG ACA TGG ACA CCT GTC AAT GGC 1876 
Tyr lie Asp Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly 
600 605 610 

ACA GAT TAC AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAG TAT 1924 
Thr Asp Tyr Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr 
6X5 620 625 630 

ATA AAA GCC AAA CTA GAA GAS ACA ATA ACT CAG GCC AGA TAT TCG GAA 1972 
lie Lys Ala Lys Leu Glu Glu Thr lie Thr Gin Ala Arg Tyr Ser Glu 
635 640 645 

ACC CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA 2020 
Thr Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala 
650 655 660 

CCA AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA 2066 
Pro Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu 
665 670 675 

TTT CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC 2116 
Phe Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn 
680 685 690 

CCA TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT OCA GGC 2164 
Pro Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly 
695 700 705 710 

TTT ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC 2212 
Phe Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He 
715 720 725 

AAG GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA 2260 
Lys Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg 
730 735 740 

GTT TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA 2308 
Val Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr 
745 750 755 
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TAT GAG GAC AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT 2356 
Tyr Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val 
760 765 770 

TTC ACT GCT CCC TAG TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG 2404 
Phe Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser 
775 780 785 790 

GGC ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT 2452 
Gly lie Met Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu 
795 800 805 

CTT AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA 2500 
lieu Lys Pro Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He 
810 815 620 

GAG AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT 2548 
Glu Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val 
825 830 835 

TGT GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT 2596 
Cys Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp 
840 845 850 

GAT GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CA6 2644 
Asp Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin 
'855 860 865 870 

ATT GGA AGA TTT TTT GGA GAG ATT GAT CCC AGC TTG ATG AGA CAC CTG 2692 
He Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu 
875 880 885 

GTT AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA 2740 
Val Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser 
890 895 900 

GTA TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA 2788 
Val Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser 
905 910 915 

GCA TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC 2836 
Ala Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala 
920 925 930 

ACT GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC 2884 
Thr Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr 
935 940 945 950 

TTT CCA CGA CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG 2932 
Phe Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr 
955 960 965 

GCC TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC 2980 
Ala Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe 
970 975 980 
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TTC GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC 3028 
Phe Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn 
985 990 995 

TGT TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA 3076 
Cys Ser Arg He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He 
1000 1005 1010 

TTC ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG 3124 
Phe He Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu 
1015 1020 1025 1030 

CTC ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG 3172 
Leu He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met 
1035 1040 1045 

GTT AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC 3220 
Val Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn 
1050 1055 1060 

AAT GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT 3268 
Asn Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn 
1065 1070 1075 

CCC TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG 3316 
Pro Ser Leu Trp Tyr He He Gly He Gin Phe Leu Leu Leu Trp Leu 
1060 1085 1090 

GTA TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT 3363 
Val Ser Gly Ser Thr His Arg Leu Leu 
1095 1100 

CTGCATAGTT TkAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACG 3423 

TAGGGTCAGC TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA 3483 

GGOGCAGACT CCTAAGGCAC CCACTGGCTG CATGTCAGG6 TGTCAGATCC TTAAACGTGT 3543 

GTGAATGCTG CATCATCTAT GT6TAACATC AAA6CAAAAT CCTATACGTQ TCCTCTATTG 3603 



GAAAATTTGG GCGTTTGTTG TTGCATTGTT GGT 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3585 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: WA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.-3295 



3636 
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(D) OTHER INFORMATION: /standard_name« **Alpha-2c" 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1. .34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3296. .3585 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GC6GGGGAG6 GGGCATTGAT CTTCGATCGC GAAG AT6 GCT GCT G6C TGC CTG 52 

Met Ala Ala Gly Cys Leu 

1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala I«eu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GT6 GAT 148 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 

25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp He Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu He Ala Ala Arg Asp He 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA 6CG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala Bis Gin Trp Arg Glu Asp Phe Ala 
105 110 115 

AGC AAT GAA GTT GTC TAG TAG AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe He 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 
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CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Qlu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT C6C GAG 628 
lieu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
165 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Txp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 

250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA A6C TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val I«eu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

JU^ TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 
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CaA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro lie Gin Trp Met Ala Cys Glu Asn 

395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu lie Pro Ser lie Gly Ala lie Arg lie Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 1348 
Thr Gin Glu Tyr Leu Asp Val Leu Gly Arg Pro ^4et Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT 6TG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val lie Thr Gly Thr Xieu Pro Val Phe Asn lie Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 1492 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu lie Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT AAA AGA CTG ACA CCA 1540 
Val Met Gly Val Asp Val Ser Leu Glu Asp lie Lys Arg Leu Thr Pro 
490 495 500 

CQT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala lie Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Thr 

520 525 530 

TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT AAA GTG GAG ATT 1684 
Leu Asp Phe Leu Asp Ala Glu I^u Glu Asn Asp lie Lys Val Glu lie 
535 540 545 550 

CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA AAA ACA TTC AGA ACT 1732 
Arg Asn Lys Met lie Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
555 560 565 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 1780 
Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 580 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 1828 
Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
585 590 595 

TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC AAA CTA GAA GAG ACA 1876 
L&a Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala Lys Leu Glu Glu Thr 
600 605 610 
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ATA ACT CAG 6CC AGA TCA AAA AAG GGC AAA ATG AAG GAT TCG GAA ACC 1924 
He Thr Gin Ala Arg Ser Lys Lys Gly Lys Met I*ys Asp Ser Glu Thr 
615 620 625 630 

CTG AAG CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA GCA CCA 1972 
Leu Lys Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala Pro 
635 640 645 

AGA GAT TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT 2020 
Arg Asp Tyr Cys Asn Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe 

650 655 660 

CTT TTA AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA 2068 
Leu Leu Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro 
665 670 675 

TCA TGT AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT 2116 
Ser Cys Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe 
680 685 690 

ACA AAT GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG AAA AAT ATC AAG 2164 
Thr Asn Glu Leu Val Gin Asn Tyr Trp Ser Lys Gin Lys Asn He Lys 

695 700 705 710 

GGA GTG AAA GCA CGA TTT GTT GTG ACT GAT GGT GG6 ATT ACC AGA GTT 2212 
Gly Val Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg Val 
715 720 725 

TAT CCC AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT 2260 
Tyr Pro Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr 
730 735 740 

GAG GAC AQC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC 2308 
Glu Asp Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe 
745 750 755 

ACT GCT CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC 2356 
Thr Ala Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly 
760 765 770 

ATT ATG GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT 2404 
He Met Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu Leu 
775 780 785 790 

AAA CCT GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG 2452 
Lys Pro Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He Glu 
795 8O0 805 

AAT TTC ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT 2500 
Asn Phe Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val Cys 
810 815 820 

GAC TGC AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT 2548 
Asp Cys Lys Arg Asn Ser Asp Val Met Asp Cys Val He Leu Asp Asp 
825 830 835 
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GGT GGG TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT CAG ATT 2596 
Gly Gly Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin He 
840 845 850 

GGA AGA TTT TTT GGA GAG ATT GAT CCC AGO TTG ATG AGA CAC CTG GTT 2644 
Gly Arg Phe Phe Gly Glu He Asp Pro Ser Leu Met Arg His Leu Val 
855 860 865 870 

AAT ATA TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA 2692 
Asn He Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val 

875 880 865 

TGT GAG CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA 2740 
Cys Glu Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala 
890 895 900 

TAT GTG CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT 2788 
Tyr Val Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr 
905 910 915 

GCT GCT GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT 2836 
Ala Ala Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe 
920 925 930 

CCA CGA CTC CTT G7U3 GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC 2884 
Pro Arg Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala 
B25 940 945 950 

TCC CTG TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC 2932 
Ser Leu Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe 
955 960 965 

GAT AAC GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT 2980 
Asp Asn Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys 
970 975 980 

TCC AGA ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC 3028 
Ser Arg He Phe His Gly Glu Lye Leu Met Asn Thr Asn Leu He Phe 
985 990 995 

ATA ATG GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC 3076 
He Met Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu 
1000 1005 1010 

ATA CAA GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT 3124 
He Gin Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val 
1015 1020 1025 1030 

AAG CAA CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT 3172 
Lys Gin Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn Asn 
1035 1040 1045 

GTC TTG GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA AAT CCC 3220 
Val Leu Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro 
1050 1055 1060 

TCC CTG TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA 3268 
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Ser Leu Trp Tyr lie He Gly He Gin Phe Leu Leu lieu Trp Leu Val 
1065 1070 1075 

TCT GGC AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACX^UAT CTGCATAGTT 3322 
Ser Gly Ser Thr His Arg Leu Leu 
1080 1085 

AAACTCCAGA CCCTGCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACX3 TAGGGTCAGC 3382 

TATAAAATCA GACAAACATT AGCTGGGCCT GTTCCATGGC ATAACACTAA GGC6CAGACT 3442 

CCTAAGGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTGT 6TGAATGCTG 3505 

CATCATCTAT GTGTAACATC AAAGCAAAAT CCTATACGT6 TCCTCTATT6 GAAAATTTGG 3562 

GCGTTTGTTG TTGCATTGTT GGT 3585 

(2) INFORMATION FOR SBQ ID N0:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: TStlA (gcmomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B) LOCATION: 35-. 3374 (61625 to 1639 & 61908 to 1928) 
(D) OTHER INFORMATION: /standard_name=: '*Alpha-2d" 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1. .34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3375.. 3565 

(Xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 31: 

GCGGGGGAGG GGGCATTGAT CTTCGATCGC GAAG ATG GCT GCT GGC TGC CTG 52 

Met Ala Ala Gly Cys Leu 

1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Thr Leu Phe Gin Ser Leu Leu He Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 148 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr He Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG 6CA AAA ACA 6CA AGT GGA GTC 196 
Lys Met Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
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40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Ijeu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA AAT AAT GCA CGC CAG CTG GTA GAA ATT GCA GCC AGG GAT ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu lie Ala Ala Arg Asp lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu Leu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 

BO 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala Bis Gin Trp Arg Glu Asp Phe Ala 
105 110 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AA6 GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA AAA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg lie Lys Pro Val Phe lie 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
Glu Asp Ala Asn Phe Gly Arg Gin He Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 628 
Leu Asn Trp Thr Ser Ala I.eu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT tCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATQ CTT ATT CTG GTG GAT GTG AGT GGA 820 
Gly Ala Ala Ser Pro Lys Asp Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
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265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC TGT TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT AAA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn lie Thr 

315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly lie Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin Leu Leu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys lie lie 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu lie Phe Asn 
360 365 370 



AAA 


TAG 


AAT 


AAA 


GAT 


AAA 


AAA 


GTA 


CGT 


GTA 


TTC 


AGG TTT 


TCA 


GTT GGT 


1204 


Lys 


Tyr 


Asn 


Lys 


Asp 


Lys 


Lys 


Val 


Arg 


Val 


Phe 


Arg Phe 


Ser 


Val Gly 




375 










380 










385 






390 




CAA 


CAC 


AAT 


TAT 


GAG 


AGA 


GGA 


CCT 


ATT 


CAG 


TGG 


ATG GCC 


TGT 


GAA AAC 


1252 


Gin 


His 


Asn 


Tyr 


Glu 


Arg 


Gly 


Pro 


lie 


Gin 


Trp 


Met Ala 


Cys 


Glu Asn 










395 






400 








405 




AAA 


GGT 


TAT 


TAT 


TAT 


GAA 


ATT 


CCT 


TCC 


ATT 


GGT 


GCA ATA 


AGA 


ATC AAT 


1300 


Lys 


Gly 


Tyr 


Tyr 


Tyr 


Glu 


lie 


Pro 


Ser 


lie 


Gly 


Ala lie 


Arg 


He Asn 








410 










415 








420 






ACT 


CAG 


GAA 


TAT 


TTG 


GAT 


GTT 


TTG 


GGA 


AGA 


CCA 


ATG GTT 


TTA 


GCA GGA 


1348 


Thr 


Gin 


Glu 


Tyr 


Leu 


Asp 


val 


Leu 


Gly 


Arg 


Pro 


Net Val 


Leu 


Ala Gly 








425 






430 








435 








GAC 


AAA 


GCT 


AAG 


CAA 


GTC 


CAA 


TGG 


ACA 


AAT 


GTG 


TAC CTG 


GAT 


GCA TTG 


1396 


Asp 


Lys 


Ala 


Lys 


Gin 


Val 


Gin 


Trp 


Thr 


Asn 


val 


Tyr Leu 


Asp 


Ala Leu 




440 










445 










450 








GAA 


CTG 


GGA 


CTT 


GTC 


ATT 


ACT 


GGA 


ACT 


CTT 


CCG 


GTC TTC 


AAC 


ATA ACC 


1444 


Glu 


Leu 


Gly 


Leu 


Val 


lie 


Thr 


Gly 


Thr 


Leu 


Pro 


Val Phe 


Asn 


He Thr 




455 








460 










465 






470 




GGC 


CAA 


TTT 


GAA 


AAT 


AAG 


ACA 


AAC 


TTA 


AAG 


AAC 


CAG CTG 


ATT 


CTT GGT 


1492 


Gly 


Gin 


Phe 


Glu 


Asn 


Lys 


Thr 


Asn 


Leu 


Lys 


Asn 


Gin Leu 


He 


Leu Gly 










475 








480 








465 




GTG 


ATG 


GGA 


GTA 


GAT 


GTG 


TCT 


TTG 


GAA 


GAT 


ATT 


AAA AGA 


CTG 


ACA CCA 


1540 


Val 


Met 


Gly 


Val 


Asp 


Val 


Ser 


Leu 


Glu 


Asp 


He 


Lys Arg 


Leu 


Thr Pro 








490 








495 








500 
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CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala lie Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG GAG CCA GTA ACA 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Glu Pro Val Thr 

520 525 530 

TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT ATT ATA GTG GAG ATT 1684 
Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp lie Lys Val Glu lie 
535 540 545 550 

CGA AAT AAG AT6 ATT GAT GGG GAA AST GGA QAA AAA ACA TTC AGA ACT 1732 
Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu Lys Thr Phe Arg Thr 
555 560 565 

CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC AAA GGA AAC AGG ACA 1780 
Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp Lys Gly Asn Arg Thr 
570 575 580 

TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC AGT TTG GCC TTG GTA 1828 
Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr Ser Leu Ala Leu Val 
585 590 595 

TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC AAA CTA GAA GAG ACA 1876 
Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala Lys Leu Glu Glu Thr 
600 605 610 

ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG CCA GAT AAT TTT GAA 1924 
He Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys Pro Asp Asn Phe Glu 
615 620 625 630 

GAA TCT GGC TAT ACA TTC ATA GCA CCA AGA GAT TAC TGC AAT GAC CTG 1972 
Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp Tyr Cys Asn Asp Leu 
635 640 645 

AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA AAT TTC AAC GAG TTT 2020 
Lys He Ser Asp Asn Asn Thr Glu Phe Ijeu Leu Asn Phe Asn Glu Phe 
650 655 660 

ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT AAC GCG GAT TTG ATT 2068 
He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys Asn Ala Asp Leu He 
665 670 675 

AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT GAA CTT GTC CAA AAT 2116 
Asn Arg Val Leu lieu Asp Ala Gly Phe Thr Asn Glu Leu Val Gin Asn 
680 685 690 

TAC TGG AGT AAG CAG AAA AAT ATC AAG GGA GTG AAA GCA CGA TTT GTT 2164 
Tyr Trp Ser Lys Gin Lys Asn He Lys Gly Val Lys Ala Arg Phe Val 
695 700 705 710 

GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC AAA GAG GCT GGA GAA 2212 
Val Thr Asp Gly Gly He Thr Arg Val Tyr Pro Lys Glu Ala Gly Glu 
715 720 725 
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AAT TGG CAA GAA AAC CCA GAG ACA TAT GAG GAG AGO TTC TAT AAA AGG 2260 
Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp Ser Phe Tyr Lys Arg 
730 735 740 

AGO CTA GAT AAT GAT AAC TAT GTT TTC ACT 6CT CCC TAC TTT AAC AAA 2308 
Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala Pro Tyr Phe Asn Lys 
745 750 755 

AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT ATG 6TA AGC AAA 6CT GTA 2356 
Ser Gly Pro Gly Ala Tyr Glu Ser Gly He Net Val Ser Lys Ala Val 
760 765 770 

GAA ATA TAT ATT CAA GGG AAA CTT CTT AAA CCT GCA GTT GTT GGA ATT 2404 
Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro Ala Val Val Gly He 
775 780 785 790 

AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC ACC AAA ACC TCA ATC 2452 
Lys He Asp Val Asn Ser Trp He Glu Asn Phe Thr Lys Thr Ser He 
795 800 805 

AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC AAA AGA AAC AGT GAC 2500 
Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys Lys Arg Asn Ser Asp 
810 815 820 

GTA ATG GAT TGT 6T6 ATT CTG GAT GAT GGT GGG TTT CTT CTG ATG GCA 2548 
Val Met Asp Cys Val He Leu Asp Asp Gly Gly Phe Leu Leu Met Ala 
825 830 835 

AAT CAT GAT GAT TAT ACT AAT CAG ATT GGA AGA TTT TTT GGA GAG ATT 2596 
Asn His Asp Asp Tyr Thr Asn Gin He Gly Arg Phe Phe Gly Glu He 
840 845 850 

GAT CCC AGC TTG ATG AGA CAC CTG GTT AAT ATA TCA GTT TAT GCT TTT 2644 
Asp Pro Ser Leu Net Arg His Leu Val Asn He Ser Val Tyr Ala Phe 
855 860 865 870 

AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG CCC GGT GCT GCA CCA 2692 
Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu Pro Gly Ala Ala Pro 

875 880 885 

AAA CAA GGA GCA GGA CAT CGC TCA GCA TAT GTG CCA TCA GTA GCA GAC 2740 
Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val Pro Ser Val Ala Asp 
890 895 900 

ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT GCC TGG TCT ATT CTA 2788 
He Leu Gin He Gly Trp Trp Ala Thr Ala Ala Ala Tzp Ser He Leu 
905 910 915 

CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA C6A CTC CTT GAG GCA GTT 2836 
Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Arg Leu Leu Glu Ala Val 
920 925 930 

GAG ATG GAG GAT GAT GAC TTC ACG GCC TCC CTG TCC AAG CAG AGC TGC 2884 
Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu Ser Lys Gin Ser Cys 
935 940 945 950 

ATT ACT GAA CAA ACC CAG TAT TTC TTC GAT AAC GAC AGT AAA TCA TTC 2932 
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Ile Thr 61u Gin Thr Gin Tyr Phe Phe Asp Asn Asp Ser Lys Ser Phe 
955 960 965 

AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA ATC TTT CAT GGA GAA 2980 
Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg lie Phe His Gly Glu 
970 975 980 

AAG CTT ATG AAC ACC RAC TTA ATA TTC ATA ATG GTT GAG AGC AAA GGG 3028 
Lys Leu Met Asn Thr Asn Leu lie Phe lie Met Val Glu Ser Lys Gly 
985 990 995 

ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA GCG GAG CAG ACT TCT 3076 
Thr Cys Pro Cys Asp Thr Arg Leu Leu lie Gin Ala Glu Gin Thr Ser 
1000 1005 1010 

GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA CCT AGA TAC CGA AAA 3124 
Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin Pro Arg Tyr Arg Lys 
1015 1020 i025 1030 

GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG GAG GAT TAT ACT GAC '3172 
Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu Glu Asp Tyr Thr Asp 
1035 1040 1045 

TGT GGT GGT GTT TCT GGA TTA AAT CCC TCC CTG TGG TAT ATC ATT GGA 3220 
Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu Tip Tyr lie lie Gly 
1050 1055 1060 

ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC AGC ACA CAC CGG CTG 3268 
lie Gin Phe Leu Leu Leu Trp Leu Val Ser Gly Ser Thr His Arg Leii 
1065 1070 1075 

TTA TGACCTTCTA AAAACCAAAT CTGCATAGTT AAACTCCA6A CCCTGCCAAA 3321 

heix 

ACATGAGCCC TGCCCTCAAT TACAGTAACG TAGGGTCASC TATAAAATCA GACAAACATT 3381 

AGCTGGGCCT GTTCCATGGC ATAACACTAA G6CGCAGACT CCTAAG6CAC CCACTGGCTG 3441 

CATGTCAGGG TGTCAGATCC TTAAACGTGT GTGAATGCTG CATCATCTAT 6TGTAACATC 3501 

AAA6CAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG GC6TTTGTTG TTGCATTGTT 3561 

GGT 3564 
(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3579 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECDLB TYPE: DNA (genomic) 



(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 35. .3289 

(D) OTHER INFORMATION: /standarci_nanie- "Alpha-2e" 

(ix) FEATURE: 

(A) NAME/KEY: 5'DTR 

(B) LOCATION: 1..34 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 3289.. 3579 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 15: 

GOGGGGGAGG GGGCATT6AT CTTCGATCGC GAAG ATG GCT OCT GGC TGC CTG 52 

Net Ala Ala Gly Cys Leu 
1 5 

CTG GCC TTG ACT CTG ACA CTT TTC CAA TCT TTG CTC ATC GGC CCC TCG 100 
Leu Ala Leu Thr Leu Tbr Leu Phe Gin Ser Leu Leu lie Gly Pro Ser 
10 15 20 

TCG GAG GAG CCG TTC CCT TCG GCC GTC ACT ATC AAA TCA TGG GTG GAT 148 
Ser Glu Glu Pro Phe Pro Ser Ala Val Thr lie Lys Ser Trp Val Asp 
25 30 35 

AAG ATG CAA GAA GAC CTT GTC ACA CTG GCA AAA ACA GCA AGT GGA GTC 196 
Lys Net Gin Glu Asp Leu Val Thr Leu Ala Lys Thr Ala Ser Gly Val 
40 45 50 

AAT CAG CTT GTT GAT ATT TAT GAG AAA TAT CAA GAT TTG TAT ACT GTG 244 
Asn Gin Leu Val Asp lie Tyr Glu Lys Tyr Gin Asp Leu Tyr Thr Val 
55 60 65 70 

GAA CCA UiT AAT GCA C6C CAG CTG GTA GAA ATT GCA GCC AGG GAT. ATT 292 
Glu Pro Asn Asn Ala Arg Gin Leu Val Glu lie Ala Ala Arg Asp lie 
75 80 85 

GAG AAA CTT CTG AGC AAC AGA TCT AAA GCC CTG GTG AGC CTG GCA TTG 340 
Glu Lys Leu I^eu Ser Asn Arg Ser Lys Ala Leu Val Ser Leu Ala Leu 
90 95 100 

GAA GCG GAG AAA GTT CAA GCA GCT CAC CAG TGG AGA GAA GAT TTT GCA 388 
Glu Ala Glu Lys Val Gin Ala Ala His Gin Trp Arg Glu Asp Phe Ala 
105 110 115 

AGC AAT GAA GTT GTC TAC TAC AAT GCA AAG GAT GAT CTC GAT CCT GAG 436 
Ser Asn Glu Val Val Tyr Tyr Asn Ala Lys Asp Asp Leu Asp Pro Glu 
120 125 130 

AAA AAT GAC AGT GAG CCA GGC AGC CAG AGG ATA MiA CCT GTT TTC ATT 484 
Lys Asn Asp Ser Glu Pro Gly Ser Gin Arg He Lys Pro Val Phe He 
135 140 145 150 

GAA GAT GCT AAT TTT GGA CGA CAA ATA TCT TAT CAG CAC GCA GCA GTC 532 
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Glu Asp Ala Asn Phe Gly Arg Gin lie Ser Tyr Gin His Ala Ala Val 
155 160 165 

CAT ATT CCT ACT GAC ATC TAT GAG GGC TCA ACA ATT GTG TTA AAT GAA 580 
His He Pro Thr Asp He Tyr Glu Gly Ser Thr He Val Leu Asn Glu 
170 175 180 

CTC AAC TGG ACA AGT GCC TTA GAT GAA GTT TTC AAA AAG AAT CGC GAG 628 
Leu Asn Trp Thr Ser Ala Leu Asp Glu Val Phe Lys Lys Asn Arg Glu 
185 190 195 

GAA GAC CCT TCA TTA TTG TGG CAG GTT TTT GGC AGT GCC ACT GGC CTA 676 
Glu Asp Pro Ser Leu Leu Trp Gin Val Phe Gly Ser Ala Thr Gly Leu 
200 205 210 

GCT CGA TAT TAT CCA GCT TCA CCA TGG GTT GAT AAT AGT AGA ACT CCA 724 
Ala Arg Tyr Tyr Pro Ala Ser Pro Trp Val Asp Asn Ser Arg Thr Pro 
215 220 225 230 

AAT AAG ATT GAC CTT TAT GAT GTA CGC AGA AGA CCA TGG TAC ATC CAA 772 
Asn Lys He Asp Leu Tyr Asp Val Arg Arg Arg Pro Trp Tyr He Gin 
235 240 245 

GGA GCT GCA TCT CCT AAA GAC ATG CTT ATT CTG GTG GAT GTG AGT GGA 620 
Gly Ala Ala Ser Pro Lys T^p Met Leu He Leu Val Asp Val Ser Gly 
250 255 260 

AGT GTT AGT GGA TTG ACA CTT AAA CTG ATC CGA ACA TCT GTC TCC GAA 868 
Ser Val Ser Gly Leu Thr Leu Lys Leu He Arg Thr Ser Val Ser Glu 
265 270 275 

ATG TTA GAA ACC CTC TCA GAT GAT GAT TTC GTG AAT GTA GCT TCA TTT 916 
Met Leu Glu Thr Leu Ser Asp Asp Asp Phe Val Asn Val Ala Ser Phe 
280 285 290 

AAC AGC AAT GCT CAG GAT GTA AGC T6T TTT CAG CAC CTT GTC CAA GCA 964 
Asn Ser Asn Ala Gin Asp Val Ser Cys Phe Gin His Leu Val Gin Ala 
295 300 305 310 

AAT GTA AGA AAT MA AAA GTG TTG AAA GAC GCG GTG AAT AAT ATC ACA 1012 
Asn Val Arg Asn Lys Lys Val Leu Lys Asp Ala Val Asn Asn He Thr 
315 320 325 

GCC AAA GGA ATT ACA GAT TAT AAG AAG GGC TTT AGT TTT GCT TTT GAA 1060 
Ala Lys Gly He Thr Asp Tyr Lys Lys Gly Phe Ser Phe Ala Phe Glu 
330 335 340 

CAG CTG CTT AAT TAT AAT GTT TCC AGA GCA AAC TGC AAT AAG ATT ATT 1108 
Gin Leu lieu Asn Tyr Asn Val Ser Arg Ala Asn Cys Asn Lys He He 
345 350 355 

ATG CTA TTC ACG GAT GGA GGA GAA GAG AGA GCC CAG GAG ATA TTT AAC 1156 
Met Leu Phe Thr Asp Gly Gly Glu Glu Arg Ala Gin Glu He Phe Asn 
360 365 370 

AAA TAC AAT AAA GAT AAA AAA GTA CGT GTA TTC AGG TTT TCA GTT GGT 1204 
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Lys Tyr Asn Lys Asp Lys Lys Val Arg Val Phe Arg Phe Ser Val Gly 
375 380 385 390 

CAA CAC AAT TAT GAG AGA GGA CCT ATT CAG TGG ATG GCC TGT GAA AAC 1252 
Gin His Asn Tyr Glu Arg Gly Pro lie Gin Trp Met Ala Cys Glu Asn 
395 400 405 

AAA GGT TAT TAT TAT GAA ATT CCT TCC ATT GGT GCA ATA AGA ATC AAT 1300 
Lys Gly Tyr Tyr Tyr Glu lie Pro Ser lie Gly Ala lie Arg He Asn 
410 415 420 

ACT CAG GAA TAT TTG GAT GTT TTG GGA AGA CCA ATG GTT TTA GCA GGA 1348 
Thr Gin Glu Tyr X.eu Asp Val Leu Gly Arg Pro Met Val Leu Ala Gly 
425 430 435 

GAC AAA GCT AAG CAA GTC CAA TGG ACA AAT GTG TAC CTG GAT GCA TTG 1396 
Asp Lys Ala Lys Gin Val Gin Trp Thr Asn Val Tyr Leu Asp Ala Leu 
440 445 450 

GAA CTG GGA CTT GTC ATT ACT GGA ACT CTT CCG GTC TTC AAC ATA ACC 1444 
Glu Leu Gly Leu Val He Thr Gly Thr Leu Pro Val Phe Asn He Thr 
455 460 465 470 

GGC CAA TTT GAA AAT AAG ACA AAC TTA AAG AAC CAG CTG ATT CTT GGT 1492 
Gly Gin Phe Glu Asn Lys Thr Asn Leu Lys Asn Gin Leu He Leu Gly 
475 480 485 

GTG ATG GGA GTA GAT GTG TCT TTG GAA GAT ATT ATIA AGA CTG ACA CCA 1540 
Val Met Gly Val Asp Val Ser Ijeu Glu Asp He Lys Arg Leu Thr Pro 
490 495 500 

CGT TTT ACA CTG TGC CCC AAT GGG TAT TAC TTT GCA ATC GAT CCT AAT 1588 
Arg Phe Thr Leu Cys Pro Asn Gly Tyr Tyr Phe Ala He Asp Pro Asn 
505 510 515 

GGT TAT GTT TTA TTA CAT CCA AAT CTT CAG CCA AAG AAC CCC AAA TCT 1636 
Gly Tyr Val Leu Leu His Pro Asn Leu Gin Pro Lys Asn Pro Lys Ser 
520 525 530 

CAG GAG CCA GTA ACA TTG GAT TTC CTT GAT GCA GAG TTA GAG AAT GAT 1684 
Gin Glu Pro Val Thr Leu Asp Phe Leu Asp Ala Glu Leu Glu Asn Asp 
535 540 545 550 

ATT AAA GTG GAG ATT CGA AAT AAG ATG ATT GAT GGG GAA AGT GGA GAA 1732 
He Lys Val Glu He Arg Asn Lys Met He Asp Gly Glu Ser Gly Glu 
555 560 565 

AAA ACA TTC AGA ACT CTG GTT AAA TCT CAA GAT GAG AGA TAT ATT GAC 1780 
Lys Thr Phe Arg Thr Leu Val Lys Ser Gin Asp Glu Arg Tyr He Asp 
570 575 580 

AAA GGA AAC AGG ACA TAC ACA TGG ACA CCT GTC AAT GGC ACA GAT TAC 1828 
Lys Gly Asn Arg Thr Tyr Thr Trp Thr Pro Val Asn Gly Thr Asp Tyr 
585 590 595 

AGT TTG GCC TTG GTA TTA CCA ACC TAC AGT TTT TAC TAT ATA AAA GCC 1876 
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Ser Leu Ala Leu Val Leu Pro Thr Tyr Ser Phe Tyr Tyr He Lys Ala 
600 605 610 

AAA CTA GAA GAO ACA ATA ACT CAG GCC AGA TAT TCG GAA ACC CTG AAG 1924 
Lys Leu Glu Glu Thr He Thr Gin Ala Arg Tyr Ser Glu Thr Leu Lys 
615 620 625 630 

CCA GAT AAT TTT GAA GAA TCT GGC TAT ACA TTC ATA 6CA CCA AGA GAT 1972 
Pro Asp Asn Phe Glu Glu Ser Gly Tyr Thr Phe He Ala Pro Arg Asp 
635 640 645 

TAC TGC AAT GAC CTG AAA ATA TCG GAT AAT AAC ACT GAA TTT CTT TTA 2020 
Tyr Cys Aan Asp Leu Lys He Ser Asp Asn Asn Thr Glu Phe Leu Leu 
650 655 660 

AAT TTC AAC GAG TTT ATT GAT AGA AAA ACT CCA AAC AAC CCA TCA TGT 2068 
Asn Phe Asn Glu Phe He Asp Arg Lys Thr Pro Asn Asn Pro Ser Cys 
665 670 675 

AAC GCG GAT TTG ATT AAT AGA GTC TTG CTT GAT GCA GGC TTT ACA AAT 2116 
Asn Ala Asp Leu He Asn Arg Val Leu Leu Asp Ala Gly Phe Thr Asn 
680 685 690 

GAA CTT GTC CAA AAT TAC TGG AGT AAG CAG A7^ AAT ATC AAG GGA GTG 2164 
Glu Leu Val Gin Asn Tyr Tip Ser Lys Gin Lys Asn He Lys Gly Val 
695 700 705 710 

AAA GCA CGA TTT GTT GTG ACT GAT GGT GGG ATT ACC AGA GTT TAT CCC 2212 
Lys Ala Arg Phe Val Val Thr Asp Gly Gly He Thr Arg Val Tyr Pro 
715 720 725 

AAA GAG GCT GGA GAA AAT TGG CAA GAA AAC CCA GAG ACA TAT GAG GAC 2260 
Lys Glu Ala Gly Glu Asn Trp Gin Glu Asn Pro Glu Thr Tyr Glu Asp 
730 735 740 

AGC TTC TAT AAA AGG AGC CTA GAT AAT GAT AAC TAT GTT TTC ACT GCT 2308 
Ser Phe Tyr Lys Arg Ser Leu Asp Asn Asp Asn Tyr Val Phe Thr Ala 
745 750 755 

CCC TAC TTT AAC AAA AGT GGA CCT GGT GCC TAT GAA TCG GGC ATT AT6 2356 
Pro Tyr Phe Asn Lys Ser Gly Pro Gly Ala Tyr Glu Ser Gly He Met 
760 765 770 

GTA AGC AAA GCT GTA GAA ATA TAT ATT CAA GGG AAA CTT CTT AAA CCT 2404 
Val Ser Lys Ala Val Glu He Tyr He Gin Gly Lys Leu Leu Lys Pro 
775 780 785 790 

GCA GTT GTT GGA ATT AAA ATT GAT GTA AAT TCC TGG ATA GAG AAT TTC 2452 
Ala Val Val Gly He Lys He Asp Val Asn Ser Trp He Glu Asn Phe 
795 800 805 

ACC AAA ACC TCA ATC AGA GAT CCG TGT GCT GGT CCA GTT TGT GAC TGC 2500 
Thr Lys Thr Ser He Arg Asp Pro Cys Ala Gly Pro Val Cys Asp Cys 
810 815 820 

AAA AGA AAC AGT GAC GTA ATG GAT TGT GTG ATT CTG GAT GAT GGT GGG 2548 
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Lys Arg Asn Ser Asp Val Met Asp Cys Val lie Leu Asp Asp Gly Gly 
825 830 835 

TTT CTT CTG ATG GCA AAT CAT GAT GAT TAT ACT AAT GAG ATT GGA AGA 2596 
Phe Leu Leu Met Ala Asn His Asp Asp Tyr Thr Asn Gin lie Gly Arg 
840 845 850 

TTT TTT GGA GAG ATT GAT CCC AGO TTG ATG AGA CAC CTG GTT AAT ATA 2644 
Phe Phe Gly Glu lie Asp Pro Ser Leu Met Arg His Leu Val Asn He 
855 860 865 870 

TCA GTT TAT GCT TTT AAC AAA TCT TAT GAT TAT CAG TCA GTA TGT GAG 2692 
Ser Val Tyr Ala Phe Asn Lys Ser Tyr Asp Tyr Gin Ser Val Cys Glu 
875 880 885 

CCC GGT GCT GCA CCA AAA CAA GGA GCA GGA CAT CGC TCA GCA TAT GTG 2740 
Pro Gly Ala Ala Pro Lys Gin Gly Ala Gly His Arg Ser Ala Tyr Val 
890 895 900 

CCA TCA GTA GCA GAC ATA TTA CAA ATT GGC TGG TGG GCC ACT GCT GCT 2788 
Pro Ser Val Ala Asp He Leu Gin He Gly Trp Trp Ala Thr Ala Ala 
905 910 915 

GCC TGG TCT ATT CTA CAG CAG TTT CTC TTG AGT TTG ACC TTT CCA CGA 2836 
Ala Trp Ser He Leu Gin Gin Phe Leu Leu Ser Leu Thr Phe Pro Arg 
920 925 930 

CTC CTT GAG GCA GTT GAG ATG GAG GAT GAT GAC TTC ACG GCC TCC CTG 2884 
Leu Leu Glu Ala Val Glu Met Glu Asp Asp Asp Phe Thr Ala Ser Leu 
935 940 945 950 

TCC AAG CAG AGC TGC ATT ACT GAA CAA ACC CAG TAT TTC TTC GAT AAC 2932 
Ser Lys Gin Ser Cys He Thr Glu Gin Thr Gin Tyr Phe Phe Asp Asn 
955 960 965 

GAC AGT AAA TCA TTC AGT GGT GTA TTA GAC TGT GGA AAC TGT TCC AGA 2980 
Asp Ser Lys Ser Phe Ser Gly Val Leu Asp Cys Gly Asn Cys Ser Arg 
970 975 980 

ATC TTT CAT GGA GAA AAG CTT ATG AAC ACC AAC TTA ATA TTC ATA ATG 3028 
He Phe His Gly Glu Lys Leu Met Asn Thr Asn Leu He Phe He Met 
985 990 995 

GTT GAG AGC AAA GGG ACA TGT CCA TGT GAC ACA CGA CTG CTC ATA CAA 3076 
Val Glu Ser Lys Gly Thr Cys Pro Cys Asp Thr Arg Leu Leu He Gin 
1000 1005 1010 

GCG GAG CAG ACT TCT GAC GGT CCA AAT CCT TGT GAC ATG GTT AAG CAA 3124 
Ala Glu Gin Thr Ser Asp Gly Pro Asn Pro Cys Asp Met Val Lys Gin 
1015 1020 1025 1030 

CCT AGA TAC CGA AAA GGG CCT GAT GTC TGC TTT GAT AAC AAT GTC TTG 3172 
Pro Arg Tyr Arg Lys Gly Pro Asp Val Cys Phe Asp Asn Asn Val Leu 
1035 1040 1045 

GAG GAT TAT ACT GAC TGT GGT GGT GTT TCT GGA TTA J\AT CCC TCC CTG 3220 



-253- 



Glu Asp Tyr Thr Asp Cys Gly Gly Val Ser Gly Leu Asn Pro Ser Leu 
1050 1055 1060 

TGG TAT ATC ATT GGA ATC CAG TTT CTA CTA CTT TGG CTG GTA TCT GGC 3268 
Trp Tyr He He Gly He Gin Phe Leu Leu Leu Trp Leu Val Ser Gly 
1065 1070 1075 

AGC ACA CAC CGG CTG TTA TGACCTTCTA AAAACCAAAT CTGCATAGTT 3316 
Ser Thr His Arg Leu Leu 

1080 108 

AAACTCCaUSA CCCTOCCAAA ACATGAGCCC TGCCCTCAAT TACAGTAACQ TAGGGTCAGC 3376 

TATAAAATCA GACAAACATT AGCTGQGCCT GTTCCATGGC ATAACACTAA GGCGCAQACT 3436 

CCTAAOGCAC CCACTGGCTG CATGTCAGGG TGTCAGATCC TTAAACGTQT GTGAATGCTC 3496 

CATCATCTAT OTGTAACATC AAAGCAAAAT CCTATACGTG TCCTCTATTG GAAAATTTGG 3556 

GCGTTTGTTG TTGCATTGTT GOT 3579 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1681 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 1..1437 

(D) OTHER INFORMATION: /standard_naiiie= "Beta-1-1" 

(ix) FEATURE : 

(A) NAME/KEY: 3'DTR 

(B) LOCATIW: 1435.. 1681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAG CCA CCC TCC CAG 48 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
15 10 15 

GAG ATC CCC ATG GAG (3TC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 96 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGQ 0C3A TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 144 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 
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ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCX3 GAG TCC TAG ACC 192 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT OTA TCT CTG GAG GAG GAC CGG GAA GCC 240 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 288 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 336 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
XOO 105 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Trp 
130 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 

150 155 160 

AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAG AAG CTG 528 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 

CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg lieu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
ISO 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GGT AAT GAA ATG ACT AAC TTA GCC TTT GAA CTA GAC CCC CTA GAG 672 
Ser Gly Asn Glu Met Thr Asn Leu Ala Phe Glu Leu Asp Pro Leu Glu 
210 215 220 

TTA GAG GAG GAA GAG GCT GAG CTT GGT GAG CAG AGT GGC TCT GCC AAG 720 
Leu Glu Glu Glu Glu Ala Glu Leu Gly Glu Gin Ser Gly Ser Ala Lys 
225 230 235 240 

ACT AGT GTT AGC AGT GTC ACC ACC CCG CCA CCC CAT GGC AAA CGC ATC 768 
Thr Ser Val Ser Ser Val Thr Thr Pro Pro Pro His Gly Lys Arg He 
245 250 255 

CCC TTC TTT AAG AAG ACA GAG CAT GTG CCC CCC TAT GAC GTG GTG CCT 816 
Pro Phe Phe Lys Lys Thr Glu His Val Pro Pro Tyr Asp Val Val Pro 
260 265 270 
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TCC ATG AG6 CCC ATC ATC CTG GTG GGA CCG TCG CTC AAG GGC TAG GAG 864 
Ser Met Arg Pro He He Leu Val Gly Pro Ser Leu Lys Gly Tyr Glu 
275 280 285 

GTT ACA GAG ATG ATG GAG AAA GCT TTA TTT GAG TTG TTG AAG GAT CGG 912 
Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu Lys His Arg 
290 295 300 

TTT GAT GGG AGG ATG TCC ATC ACT CGT GTG AC6 GGA GAT ATT TCC CTG 960 
Phe Asp Gly Arg He Ser He Thr Arg Val Thr Ala Asp He Ser Leu 
305 310 315 320 

GCT AAG CGG TCA GTT GTG AAG AAG CCC A6C AAA CAC ATG ATG ATT GAG 1008 
Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His He He He Glu 

325 330 335 

CGG TCC AAG ACA CGG TCC AGG GTG GCT GAG GTG CAG AGT GAA ATC GAG 1056 
Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser Glu He Glu 
340 345 350 

GGA ATC TTG GAG CTG GGC CGG AGG GTT CAG TTG GTG GCT GTG GAT GCT 1104 
Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Ala Leu Asp Ala 
355 360 365 

GAG AGG ATG AAT CAC CCA GGC CAG CTG TCC AAG AGG TCG GTG GGC CCC 1152 
Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser Leu Ala Pro 
370 375 380 

ATC ATT GTT TAG ATG AAG ATC ACC TCT CCC AAG GTA GTT CAA AGG CTG 1200 
He He Val Tyr He Lys He Thr Ser Pro Lys Val Leu Gin Arg Leu 
385 390 395 400 

ATC AAG TCC GGA GGA AAG TCT CAG TCG AAA CAC CTG AAT GTG CAA ATA 1248 
He Lys Ser Arg Gly Lys Ser Gin Ser Lys His Leu Asn Val Gin He 
405 410 415 

GCG GGG TCG GAA AAG CTG GCA CAG TGC CCC CGT GAA ATG TTT GAC ATG 1296 
Ala Ala Ser Glu Lys Leu Ala Gin Cys Pro Pro Glu Met Phe Asp He 
420 425 430 

ATC CTG GAT GAG AAC CAA TTG GAG GAT GCC TGC GAG CAT GTG GCG GAG 1344 
He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His Leu Ala Glu 
435 440 445 

TAC TTG GAA GCC TAT TGG AAG GCC ACA CAC CCG CCC AGG AGC AGG CCA 1392 
Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser Ser Thr Pro 
450 455 460 

CCC AAT CCG CTG CTG AAC CGG ACC ATG GCT ACC GCA GCG GTG GCT 1437 
Pro Asn Pro Leu Leu Asn Arg Thr Met Ala Thr Ala Ala Leu Ala 
465 470 475 

GCCAGCCCTG CCCCTGTCTC CAACCTCCAG GTACAGGTGC TCAGCTCGCT CAGGAGAAAC 1497 
CTCGGCTTCT GGGGCGGGCT GGAGTCCTCA CAGCGGGGCA GTGTGGTGGG CCAGGAGCAG 1557 
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QAACATGCCA TGTAGTGGGC GCCCTGCCCG TCTTCCCTCC TGCTCTGGGG TCGGAACTGG 
AGTGCAGGGA ACATGGAGGA GGAAGGGAAG AGCTTTATTT TGTAAAAAAA TAAGATGAGC 
GGCA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCAT ION: 1..651 

(D) OTHER INFORMATION: /standard_name« "Beta-1-4" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG GTC CAG AAG ACC AGC ATG TCC CGG GGC CCT TAC CCA CCC TCC CAG 
Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
1 5 10 15 

GAG ATC CCC ATG GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys Tyr Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACG TCC TCG GAT 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GC6 GAG TCC TAC ACC 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser Tyr Thr 
50 55 60 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
65 70 75 80 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
3.00 105 110 

CCG TCT CCA OGG GAT GAG GTG CCT GTG CAG 6GA GTG GCC ATC ACC TTC 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
115 120 125 



'WO vS/tt4X77 
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Su p™ ^ ^ x™* ^ *^ ®^ AAA TAG AAT AAT GAG TGG 

Glu Pro Lys Asp Phe Leu His lie hya Glu Lys Tyr Asn Asn Asp ^ 

135 140 

T?'^ ^ P° ®™ <»G GGC TGT GAG GTT GGC TTC ATT CCC 

Tjp He Gly Arg Leu val Lys Glu Gly Cys Glu val Gly Phe lie 

150 155 j^gQ 

S^Sf^rT^^^^^^^^^CTGO^GAACAGAAGCTG 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
1*5 170 j^^g 

CQC CAG AAC CGC CTC GGC TCC AGG AAA TCA QGC GAT AAG TCC AflT Tcr 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser S 2^ Sr Sr 

180 185 190 

AST CTG 6GA GAT GTQ GTQ ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 
ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro tS ^ 

200 205 

^ S ?S ^ 2^ 2J ^^'"^^^ ccttccatga ggcccatcat 

210 215 
CCTGGTGGGA CCGTCGCTCA AGGGCTACGA GGTTACAGAC ATGATGCAGA AAGCTTTATT 
TGACTTCTTG AAGCATCGGT TTGATGGCAG GATCTCCATC ACTCGTGTGA CGGCAGATAT 
TTCCCTGGCT AAOCGCTCAG TTCTCAACAA CCCCAGCAAA CACATCATCA TTGAGCGCTC 
CAACACACGC TCCAGCCTGG CTGAGGTGCA GAGTGAAATC GAGCGAATCT TCGAGCTGGC 
CCGGACCCTT CAGTTGGTCG CTCTGGATGC TGACACCATC AATCACCCAG CCCAGCTGTC 
CAAGACCTCG CTGGCCCCCA TCATTGTTTA CATCAAGATC ACCTCTCCCA AGGTACTTCA 
AAGGCTCATC AAGTCCCX3AG GAAAOTCTCA 6TCCAAACAC CTCAATGTCC AAATAGCGGC 
CTCGGAAAAG CTGGCACAGT GCCCCCCTGA AATGTTTGAC ATCATCCTGG ATGAGAACCA 
ATTGGAGGAT GCCTGCGAGC ATCTGGCGGA GTACTTGGAA GCCTATTGGA AOGCCACACA 
CCCGCCCAGC AGCACGCCAC CCAATCCGCT GCTGAACCGC ACCATGGCTA CCQCAGCCCT 
GGCTGCCAGC CCTGCCCCTG TCTCCAACCT CCAGGTACAG GTGCTCACCT CGCTCAGQAG 
AAACCTCGGC TTCTGGGGCG GGCTGGAGTC CTCACAGCGG GGCAGTGTGG TGCCCCAGGA 
GCAGGAACAT GCCATGTAGT GGGCGCCCTG CCC6TCTTCC CTCCTGCTCT GGGGTCGGAA 
CTGGAGTGCA GGGAACATQG AGGAGGAAGG GAAGAGCTTT ATTTTGTAAA AAAATAAGAT 
GAGCGGCA 

(2) INFOSMATION FOR SSQ ID NO: 35: 



432 

480 

528 

576 

624 

678 

738 
798 
858 
918 
978 

1038 

1098 

1158 

1218 

1278 

1338 

1398 

1458 

1518 

1526 
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(i) SEQXIENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 
<B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: VtOi (genomic) 



48 



96 



(ix) FSATDRE: 

(A) NAME/KEY: CDS 

(B) L OCAT ION: 1. .660 

(D) OTHER INFORMATION: /standard_^name= -Beta-l-5" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

JI? vl? ^ CCT TAC CCA CCC TCC CAG 

Met Val Gin Lys Thr Ser Met Ser Arg Gly Pro Tyr Pro Pro Ser Gin 
^ 10 15 

GAG ATC CCC AT6 GAG GTC TTC GAC CCC AGC CCG CAG GGC AAA TAC AGC 
Glu He Pro Met Glu Val Phe Asp Pro Ser Pro Gin Gly Lys ^ Ser 
20 25 30 

AAG AGG AAA GGG CGA TTC AAA CGG TCA GAT GGG AGC ACQ TCC TCG GAT 
Lys Arg Lys Gly Arg Phe Lys Arg Ser Asp Gly Ser Thr Ser Ser Asp 
35 40 45 ^ 

ACC ACA TCC AAC AGC TTT GTC CGC CAG GGC TCA GCG GAG TCC TAC ACC 
Thr Thr Ser Asn Ser Phe Val Arg Gin Gly Ser Ala Glu Ser 
50 55 €0 

AGC CGT CCA TCA GAC TCT GAT GTA TCT CTG GAG GAG GAC CGG GAA GCC 
Ser Arg Pro Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala 
®^ 70 75 gQ 

TTA AGG AAG GAA GCA GAG CGC CAG GCA TTA GCG CAG CTC GAG AAG GCC 
Leu Arg Lys Glu Ala Glu Arg Gin Ala Leu Ala Gin Leu Glu Lys Ala 
85 90 95 

AAG ACC AAG CCA GTG GCA TTT GCT GTG CGG ACA AAT GTT GGC TAC AAT 
Lys Thr Lys Pro Val Ala Phe Ala Val Arg Thr Asn Val Gly Tyr Asn 
100 105 110 

CCG TCT CCA GGG GAT GAG GTG CCT GTG CAG GGA GTG GCC ATC ACC TTC 384 
Pro Ser Pro Gly Asp Glu Val Pro Val Gin Gly Val Ala He Thr Phe 
lis 120 125 

GAG CCC AAA GAC TTC CTG CAC ATC AAG GAG AAA TAC AAT AAT GAC TGG 432 
Glu Pro Lys Asp Phe Leu His He Lys Glu Lys Tyr Asn Asn Asp Tm 
"0 135 140 

TGG ATC GGG CGG CTG GTG AAG GAG GGC TGT GAG GTT GGC TTC ATT CCC 480 
Trp He Gly Arg Leu Val Lys Glu Gly Cys Glu Val Gly Phe He Pro 

150 155 160 



144 



192 



240 



288 
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AGC CCC GTC AAA CTG GAC AGC CTT CGC CTG CTG CAG GAA CAQ AAG CTG 
Ser Pro Val Lys Leu Asp Ser Leu Arg Leu Leu Gin Glu Gin Lys Leu 
165 170 175 



528 



CGC CAG AAC CGC CTC GGC TCC AGC AAA TCA GGC GAT AAC TCC AGT TCC 576 
Arg Gin Asn Arg Leu Gly Ser Ser Lys Ser Gly Asp Asn Ser Ser Ser 
180 185 190 

AGT CTG GGA GAT GTG GTG ACT GGC ACC CGC CGC CCC ACA CCC CCT GCC 624 
Ser Leu Gly Asp Val Val Thr Gly Thr Arg Arg Pro Thr Pro Pro Ala 
195 200 205 

AGT GGT TAG AGA CAT GAT GCA GAA AGC TTT ATT TGACTTCTTG AAGCATCGGT 677 
Ser Gly Tyr Arg His Asp Ala Glu Ser Phe lie 

210 215 220 

TTGATGGCAG GATCTCCATC ACTCGTGTGA CGGCAGATAT TTCCCTGGCT 7AGC6CTCAG 737 

TTCTCAACAA CCCCAGCAAA CACATCATCA TTGAGCGCTC CAACACACGC TCCAGCCTGG 797 

CTGAGGTGCA GAGTGAAATC GAGCGAATCT TCGAGCTGGC CCGGACCCTT CAGTTGGTCG 857 

CTCTGGATGC TGACACCATC AATCACCCAG CCCAGCTGTC CAAGACCTCG CTGGCCCCCA 917 

TCATTGTTTA CATCAAGATC ACCTCTCCCA AGGTACTTCA AAGGCTCATC AAGTCCCGAG 977 

GAAAGTCTCA GTCCAAACAC CTCAATGTCC AAATAGCGGC CTCGQAAAAG CTGGCACAGT 1037 

GCCCCCCTGA AATGTTTGAC ATCATCCTGG ATGAGAACCA ATTGGAGGAT GCCTGC6AGC 1097 

ATCTGGCGGA GTACTTGGAA GCCTATTGGA AGGCCACACA CCCGCCCAOC AGCACGCCAC 1157 

CCAATCCGCT GCTGAACCGC ACCATGGCTA CCGCAGCCCT GGCTGCCAGC CCTGCCCCTG 1217 

TCTCCAACCT CCAGGTACAG GTGCTCACCT CGCTCAGGA6 AAACCTCGGC TTCTGGGGCG 1277 

GGCTGGAGTC CTCACAGCGG GGCAGTGTGG TGCCCCAGGA GCAGGAACAT GCCATCTAGT 1337 

GGGCGCCCTG CCCGTCTTCC CTCCTGCTCT GGGGTCGGAA CTGGAGTGCA GGGAAC 1393 
(2) INFORMATION FOR S5Q ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 6725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) L OCAT ION; 226.. 6642 

(D) OTHER INFORMATION: /standard_names »Alpha-lC-2» 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTCGAGGAGG CAGTAGTGGA AAGGAGCAGT TTTTGGGGTT TCATGCCATA ATGGGAATCA 60 

GGTAATCGTC GGCGGGGAAG AAGAAACGCT GCAGACCACG GCTTCCTCGA ATCTTGCGCG 120 

AAAGCCGCCG GCCTCGGAOG AGGGATTAAT CCAGACCCGC CGGGGGGTGT TTTCACATTT 180 

CTTCCTCTTC GTGGCTGCTC CTCCTATTAA AACCATTTTT GGTCC AT6 GTC AAT 234 

Met Val Asn 
1 



50 



?^ ^ ®^ GCT AAG CTG ATG GGC AGC 

Trp Gin Ala Ala He Asp Ala Ala Arg Gin Ala Lys Leu Met Gly Ser 
55 60 



65 



^ m 5 ff^ ^ TCC ACG CAG CGG AAG CGG 

Ala Gly Asn Ala Thr He Ser Thr Val Ser Ser Thr Gin Arg Lys A^ 



80 



282 



^ ^ ^ ATT CCA GAG GAA AAC CAC CAA GGT TCC AAC 

Glu Asn Thr Arg Met Tyr He Pro Glu Glu Asn His Gin Gly Ser Asn 
^ 10 15 

5^ ^ SfS ?^ ATG AAT GCC AAT GCG GCA 330 

Tyr Gly Ser Pro Arg Pro Ala His Ala Asn Met Asn Ala Asn Ala Ala 

25 30 35 

GOT GTC CTG OCC CCT GAG CAC ATC CCC ACC CCG GGG GCT GCC CTG TCG 
Ala Gly Leu Ala Pro ^lu His He Pro Thr Pro Gly Ala Ala Leu Ser 
*0 45 



378 



426 



474 



522 



618 



CAA TAT GGG AAA CCC AAG AAG CAG GGC AGC ACC ACG GCC ACA CGC 
Gin Gin Tyr Gly Lys Pro Lys Lys Gin Gly Ser Thr Thr Ala Thr ^ 
85 90 95 ^ 

^ S^o ^ P"^ ^ CCC ATC CGG AGG 570 

Pro Pro Arg Ala Leu Leu Cys Leu Thr Leu Lys Asn Pro He Arg Arg 
"° 105 110 

GCC TOC ATC AGC ATT GTC GAA TGG AAA CCA TTT GAA ATA ATT ATT TTA 
Ala Cys He Ser He Val Glu Trp Lys Pro Phe Glu He He He Leu 

120 125 130 

CTG ACT ATT TTT GCC AAT TGT GTG GCC TTA GCG ATC TAT ATT CCC TTT 666 
Leu Thr He Phe Ala Asn Cys Val Ala Leu Ala He Tyr He Pro Phe 
135 140 145 

CCA GAA GAT GAT TCC AAC GCC ACC AAT TCC AAC CTG GAA CGA GTG GAA 714 
Pro Glu Asp Asp Ser Asn Ala Thr Asn Ser Asn Leu Glu Arg Val Glu 
150 155 160 

TAT CTC TTT CTC ATA ATT TTT ACG GTG GAA GCG TTT TTA AAA GTA ATC 762 
Tyr Leu Phe Leu He He Phe Thr Val Glu Ala Phe Leu Lys Val He 
165 170 175 



Will *S/i*4JN;V,;/. 
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GCC TAT GGA CTC CTC TTT CAC CCC AAT GCC TAG CTC CGC AAC GGC TGG 
Ala Tyr Gly Leu Leu Phe His Pro Asn Ala Tyr Leu Arg Asn Gly Trp 

185 190 

AAC CTA CTA GAT TTT ATA ATT GTG GTT QTG GGQ CTT TTT AGT GCA ATT 
Asn Leu Leu Asp Phe He He Val Val Val Gly Leu Phe Ser Ala He 
200 205 210 

Leu Glu Gin Ala Thr Lys Ala Asp Gly Ala Asn Ala Leu Gly Gly Lys 



215 220 



225 



?f 2?^ 3^ ^ AGG GCC TTC CGC GTG CTG CGC 

Gly Ala Gly Phe Asp Val Lys Ala Leu Arg Ala Phe Arg Val Leu ^ 

230 235 - - ^ 



240 



810 



858 



906 



954 



1002 



1050 



1098 



1146 



1194 



CCC CTG CGG CTG GTG TCC GGA GTC CCA AGT CTC CAG GTG GTC CTG AAT 
Pro Leu Arg Leu Val Ser Gly Val Pro Ser Leu Gin Val Val Leu Asn 
245 250 255 

cf ^ tV^ ®f ^ CTG CTG CAC ATC GCC CTG CTT GTG 

Ser He He Lys Ala Met Val Pro Leu Leu His He Ala Leu Leu Val 
2^0 265 270 275 

CTG TTT GTC ATC ATC ATC TAC GCC ATC ATC GGC TTG GAG CTC TTC ATG 
Leu Phe Val He He He Tyr Ala He He Gly Leu Glu Leu Phe Me? 

280 285 290 

GGG AAG ATG CAC AAG ACC TGC TAC AAC CAG GAG GGC ATA GCA GAT GTT 
Gly Lys Met Has Lys Thr Cys Tyr Asn Gin Glu Gly He Ala Asp Val 
295 300 305 

CCA GCA GAA GAT GAC CCT TCC CCT TGT GCG CTG GAA ACG GGC CAC GGG 
Pro Ala Glu Asp Asp Pro Ser Pro Cys Ala Leu Glu Thr Gly His Glv 
310 315 320 

CGG CAG TGC CAG AAC GGC ACG GTG TGC AAG CCC GGC TGG GAT GGT CCC 1242 
Arg Gin Cys Gin Asn Gly Thr Val Cys Lys Pro Gly Trp Asp Gly Pro 
325 330 335 

AAG CAC GGC ATC ACC AAC TTT GAC AAC TTT GCC TTC GCC ATG CTC ACQ 1290 
Lys His Gly He Thr Asn Phe Asp Asn Phe Ala Phe Ala Met Leu Thr 
340 345 350 355 

GTG TTC CAG TGC ATC ACC ATG GAG GGC TGG ACG GAC GTG CTG TAC TGG 1338 
Val Phe Gin Cys He Thr Met Glu Gly Trp Thr Asp Val Leu Tyr Trp 
360 365 370 

GTC AAT GAT GCC GTA GGA AGG GAC TGG CCC TGG ATC TAT TTT GTT ACA 1386 
Val Asn Asp Ala Val Gly Arg Asp Trp Pro Trp He Tyr Phe Val Thr 
375 380 385 

CTA ATC ATC ATA GGG TCA TTT TTT GTA CTT AAC TTG GTT CTC GGT GTG 1434 
Leu He He He Gly Ser Phe Phe Val Leu Asn Leu Val Leu Gly Val 
390 395 400 



1482 



1530 



1626 



1674 



1722 
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5^ e Sf^ ^ ^ ^ ^ ^ AAG GCC AAG GCC CGG GGA 

Leu Ser Gly Glu Phe Ser Lys Glu Arg 61u Lys Ala Lys Ala ^ Gly 
405 415 ^ y 

GAT TTC CAG AAG CTO CGG GAG AAG CAG CAG CTA GAA GAG GAT CTC AAA 
Asp Phe Gin Lys Leu Arg Glu Lys Gin Gin Leu Glu Glu A^ Leu L^ 

*25 430 435 

GGC TAG CTG GAT TOG ATC ACT GAG GCC GAA GAC ATC GAT CCT GAG AAT 1578 
Gly Tyr Leu Asp Trp He Thr Gin Ala Glu Asp He Asp Pro Glu aS 
440 445 450 

GAG GAC GAA GGC ATG GAT GAG GAG AAG CCC C6A AAC ATG AGC ATQ CCr 
Glu Asp Glu Gly Met Asp Glu Glu Lys Pro Arg Asn Met Ser 
455 460 465 

^ ^ ^ ^ AAC GTG GCT GGA GGT GAC 

Thr ser Glu Thr Glu Ser Val Asn Thr Glu Asn Val Ala Gly Gly Asp 
470 475 480 

T? f ^ ^ S?^ ^ ^ CTG GCC CAC CGG ATC TCC AAG 

He Glu Gly Glu Asn Cys Gly Ala Arg Leu Ala His Arg He Ser ^ 
*B5 490 495 

^ SfS ^ '^^ TTC TGC AGA AGG 1770 

Ser Lys Phe Ser Arg Tyr Tip Arg Arg Trp Asn Arg Phe Cys Arg Arg 

505 510 515 

AAG TGC CGC GCC GCA GTC AAG TCT AAT GTC TTC TAC TGG CTG GTG ATT 1818 
Lys Cys Arg Ala Ala Val Lys Ser Asn val Phe Tyr Trp Leu Val lie 

520 525 530 

TTC CTG GTG TTC CTC AAC ACG CTC ACC ATT GCC TCT GAG CAC TAC AAC 
Phe Leu Val Phe Leu Asn Thr Leu Thr He Ala Ser Glu His Tyr Asn 
535 540 545 

CAG CCC AAC TGG CTC ACA GAA GTC CAA GAC ACG GCA AAC AAG GCC CTG 1914 
Gin Pro Asn Trp Leu Thr Glu Val Gin Asp Thr Ala Asn Lys Ala Leu 
550 555 5gQ 

CTG GCC CTG TTC ACG GCA GAG ATG CTC CTG AAG ATG TAC AGC CTG GGC 1962 
Leu Ala Leu Phe Thr Ala Glu Met Leu Leu Lys Met Tyr Ser Leu Gly 

565 570 575 ' 

CTG CAG GCC TAC TTC GTG TCC CTC TTC AAC CGC TTT GAC TGC TTC GTC 2010 
Leu Gin Ala Tyr Phe Val Ser Leu Phe Asn Arg Phe Asp Cys Phe Val 
580 585 590 595 

GTG TCT GGC GGC ATC CTG GAG ACC ATC CTG GTC GAG ACC AAG ATC ATO 2058 
Val Cys Gly Gly He Leu Glu Thr He Leu Val Glu Thr Lys He Met 
600 605 610 

TCC CCA CTG GGC ATC TCC GTG CTC AGA TGC GTC COG CTC CTC AGG ATT 2106 
Ser Pro Leu Gly He Ser Val Leu Arg Cys Val Arg Leu Leu Arg He 
€15 620 625 



1866 
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ATC ^ TAC raa JJUC TCC TIG JiGC MC CTG GTG GCA TCC 
Phe Lys !!• Thr Arg Tyr Trp Asa Ser Leu Ser Asn Leu Val Sa Ser 
630 



640 



WG CTG AAC TCT GTG CGC TCC ATC GCC TCC CTG CTC CTT CTC CTC TTC 
Leu Leu Asn Ser Val Arg Ser He Ala Ser Leu Leu Lei 2u 2S 

650 655 

2S Se ill « f '^'^ c^'' CTG GGG ATC CAG CTC TTT GGA GGA 

Leu Phe He He He Phe Ser Leu Leu Gly Met Gin Leu Phe Gly Gly 

665 670 675 

W« ^ ^ ^ AGG AGC ACA TTC GAT AAC 

Lys Phe Asn Phe Asp Glu Met Gin Thr Arg Arg Ser Thr Phe Asp Asn 
680 6B5 690 

S^f e^^ CTC CTC ACT GTG TTT CAG ATC CTG ACC GGG GAG GAC 

Phe Pro Gin Ser Leu Leu Thr Val Phe Gin He Leu Thr Gly Glu Asp 
695 700 705 

TCG AAT TCG GTG ATG TAT GAT GGG ATC ATG GCT TAT GGC GGC CCC TCT 
Trp Asn Ser Val Met Tyr Asp Gly He Met Ala Tyr Gly Gly Pro Ser 

'J-O 715 720 

^ 2?^ S'^ A" TTC ATC ATC CTC TTC ATC TCT 

Phe Pro Gly Met Leu Val Cys He Tyr Phe He He Leu Phe He Ss 

730 735 

GGA AAC TAT ATC CTA CTG AAT GTG TTC TTG GCC ATT GCT GTG GAC AAC 
Gly Asn Tyr He Leu Leu Asn Val Phe Leu Ala He Ala Val Asp Asn 

745 750 755 

?P ^ TCT GCC CAA AAG GAG GAG GAA GAG 

Leu Ala Asp Ala Glu Ser Leu Thr Ser Ala Gin Lys Glu Glu Glu Glu 
760 765 770 

S?f S?® ^ ^ ^ ^ ACT GCC AGC CCA GAG AAG AAA 2586 

Glu Lys Glu Arg Lys Lys Leu Ala Arg Thr Ala Ser Pro Glu Lys ^ 
775 780 785 

CAA GAG TOG GTG GM AAG CCG GCA GTG GGG GAA TCC ^ 2634 
Gin Glu Leu Val Glu Lys Pro Ala Val Gly Glu Ser Lys Glu Glu Lvs 
790 795 800 

ATT GAG CTG AAA TCC ATC ACG GCT GAC GGA GAG TCT CCA CCC GCC ACC 
He Glu Leu Lys Ser He Thr Ala Asp Gly Glu Ser Pro Pro Ala Thr 
805 810 815 

AAG ATC AAC ATG GAT GAC CTC CAG CCC AAT GAA AAT GAG GAT AAG AGC 2730 
Lys He Asn Met Asp Asp Leu Gin Pro Asn Glu Asn Glu Asp Lys Ser 
820 825 830 835 

CCC TAC CCC AAC CCA GAA ACT ACA GGA GAA GAG GAT GAG GAG GAG CCA 2778 
Pro Tyr Pro Asn Pro Glu Thr Thr Gly Glu Glu Asp Glu Glu Glu Pro 
840 845 850 



2154 



2202 



2250 



2298 



2346 



2394 



2442 



2490 



2538 



2682 
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GAG ATG CCT GTC GGC CCT CGC CCA CGA CC31 CTC TCT GAG CTT CAC CTT 2826 
Glu Met Pro Val Gly Pro Arg Pro Arg Pro Leu Ser Glu Leu His Leu 
855 860 865 

AAG GAA AAG GCA GTG CCC ATG CCA GAA GCC AGC GCG TTT TTC ATC TTC 2874 
Lys Glu Lys Ala Val Pro Met Pro Glu Ala Ser Ala Phe Phe lie Phe 
870 875 880 

AGC TCT AAC AAC AGG TTT CGC CTC CAG TGC CAC CGC ATT GTC AAT GAC 2922 
f «f ^ ^ Cys His Arg He Val Aan Asp 

885 890 895 

ACG ATC TTC ACC AAC CTG ATC CTC TTC TTC ATT CTG CTC AGC AGC ATT 2970 
Thr lie Pbe Thr Asn Leu He Leu Phe Phe He Leu Leu Ser Ser He 
^00 905 910 915 

TCC CTG GCT OCT GAG GAC CCG GTC CAG CAC ACC TCC TTC AGG AAC CAT 3018 
Ser Leu Ala Ala Glu Asp Pro Val Gin His Thr Ser Phe Arg Asn His 
920 925 930 

ATT CTG TTT TAT TTT GAT ATT GTT TTT ACC ACC ATT TTC ACC ATT GAA 3066 
He Leu Phe Tyr Phe Asp He Val Phe Thr Thr He Phe Thr He Glu 
935 940 945 

ATT GCT CTG AAG ATG ACT GCT TAT GGG GCT TTC TTG CAC AAG GGT TCT 3114 
He Ala Leu Lys Met Thr Ala Tyr Gly Ala Phe Leu His Lys Gly Ser 
»50 955 960 

TTC TGC CGG AAC TAC TTC AAC ATC CTG GAC CTG CTG GTG GTC AGC GTG 3162 
Phe Cys Arg Asn Tyr Phe Asn He Leu Asp Leu Leu Val Val Ser Val 
965 970 975 

TCC CTC ATC TCC TTT GGC ATC CAG TCC AGT GCA ATC AAT GTC GTG AAG 3210 
Ser Leu He Ser Phe Gly He Gin Ser Ser Ala He Asn Val Val Lys 
^80 985 990 995 

ATC TTG CGA GTC CTG CGA GTA CTC AGG CCC CTG AGG GCC ATC AAC AGG 3258 
He Leu Arg Val Leu Arg Val Leu Arg Pro Leu Arg Ala He Asn Arg 
1000 1005 1010 

GCC AAG GGG CTA AAG CAT GTG GTT CAG TGT GTG TTT GTC GCC ATC CGG 3306 
Ala Lys Gly Leu Lys His Val Val Gin Cys Val Phe Val Ala He Arg 
1015 1020 1025 

ACC ATC GGG AAC ATC GTG ATT GTC ACC ACC CTG CTG CAG TTC ATG TTT 3354 
Thr He Gly Asn He Val He Val Thr Thr Leu Leu Gin Phe Met Phe 
1030 1035 1040 

GCC TGC ATC GGG GTC CAG CTC TTC AAG GGA AAG CTG TAC ACC TGT TCA 3402 
Ala Cys He Gly Val Gin Leu Phe Lys Gly Lys Leu Tyr Thr Cys Ser 
1045 1050 1055 

GAC AGT TCC AAG CAG ACA GAG GCG GAA TGC AAG GGC AAC TAC ATC ACG 3450 
Asp Ser Ser Lys Gin Thr Glu Ala Glu Cys Lys Gly Asn Tyr He Thr 
^OSO 1065 1070 1075 
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TAC AAA GAC GGG GAG GTT GAC CAC CCC ATC ATC CAA CCC CGC AGC TGG 
Tyr Lys Asp Gly Glu Val Asp His Pro He He Gin Pro S Ser S 
"80 1085 1090 

S?® ^ ^ AAT GTT CTG GCA GCC ATG ATG GCC 

Glu Asn ser Lys Phe Asp Phe Asp Asn Val Leu Ala Ala Met m2 
1095 1100 1105 

CTC TTC ACC GTC TCC ACC TTC GAA GGG TGG CCA GAG CTG CT6 TAG CGC 
Leu Phe Thr Val Ser Thr Phe Glu Gly Trp Pro Glu Leu Leu Sr 
^110 1115 1120 

If ^ T^f ^ ACG GAA GAC AAG QGC CCC ATC TAG AAC TAG CGT 

Ser lie Asp Ser His Thr Glu Asp Lys Gly Pro He Tyr Asn Tyr Arg 
1125 1130 3^235 ^ » 

CTG GAG ATC TCC ATC TTC TTC ATC ATC TAC ATC ATC ATC ATC GCC TTC 
Val Glu He Ser He Phe Phe He He Tyr He He He lie Ma ^e 

1145 1150 1155 

TTC ATG ATG AAC ATC TTC GTG GGC TTC GTC ATC GTC ACC TTT CAG GAG 
Phe Met Met Asn He Phe Val Gly Phe Val He Val Thr Phe Gin ^ 
11^0 1165 ^^jQ 

CAG GGG GAG CAG GAG TAC AAG AAC TGT GAG CTG GAC AAG AAC CAG CGA 
Om Gly Glu Gin Glu Tyr Lys Asn Cys Glu Leu A^ ^ gS S 
1175 1180 1185 

^ vl? ^ ^ CTG CGG AGG TAC ATC 

Gin Cys Val Glu Tyr Ala Leu Lys Ala Arg Pro Leu Arg Arg Tyr He 
11^0 1195 1200 

CCC AAG AAC CAG CAC CAG TAC AAA GTG TGG TAC GTG GTC AAC TCC ACC 
Pro Lys Asn Gin His Gin Tyr Lys Val Trp Tyr Val Val Asn Ser Thr 
1205 1210 1215 

TAC TTC GAG TAC CTG ATG TTC GTC CTC ATC CTG CTC AAC ACC ATC TGC 
Tyr Phe Glu Tyr Leu Met Phe Val Leu He Leu Leu Asn Thr He Cvs 
1220 1225 1230 1235 

CTG GCC ATG CAG CAC TAC GGC CAG AGC TGC CTG TTC AAA ATC GCC ATG 
Leu Ala Met Gin His Tyr Gly Gin Ser Cys Leu Phe Lys He Ala Met 
1240 1245 1250 

AAC ATC CTC AAC ATG CTC TTC ACT GGC CTC TTT ACC GTG GAG ATG ATC 
Asn He Leu Asn Met Leu Phe Thr Gly Leu Phe Thr Val Glu Met He 
1255 1260 1265 

CTG AAG CTC ATT GCC TTC AAA CCC AAG CAC TAT TTC TGT GAT GCA TGG 
Leu Lys Leu He Ala Phe Lys Pro Lys His Tyr Phe Cys Asp Ala Trp 
1270 1275 1280 

AAT ACA TTT GAC GCC TTG ATT GTT GTG GGT AGC ATT GTT GAT ATA GCA 
Asn Thr Phe Asp Ala Leu He Val Val Gly Ser He Val Asp He Ala 
1285 1290 1295 



3498 



3546 



3594 



3642 



3690 



3738 



3786 



3834 



3882 



3930 



3978 



4026 



4074 



4122 



.266^ 



ATC ACC GAG GTA AAC CCA GCT GAA CAT ACC CAA TGC TCT CCC TCT ATG 4170 
He Thr Glu Val Asn Pro Ala Glu His Thr Gin Cys Ser Pro Ser Met 
"00 1305 1310 1315 

AAC GCA GAG GAA AAC TCC CGC ATC TCC ATC ACC TTC TTC CGC CTG TTC 4218 
Asn Ala Glu Glu Asn Ser Arg He Ser He Thr Phe Phe Arg Leu Phe 
1320 1325 1330 

CGG 6TC ATG CGT CTG GT6 AAG CTG CTG AGC CGT GGG GAG GGC ATC CGG 4266 
Arg Val Met Arg Leu Val Lys Leu Leu Ser Arg Gly Glu Gly He Arg 
1335 1340 1345 

ACG CTG CTG TGG ACC TTC ATC AAG TCC TTC CAG GCC CTG CCC TAT GTG 4314 
Thr Leu Leu Trp Thr Phe He Lys Ser Phe Gin Ala lieu Pro Tyr Val 
1350 1355 1360 

GCC CTC CTG ATC GTG ATG CTG TTC TTC ATC TAC GCG GTG ATC GGG ATG 4362 
Ala Leu Leu He Val Met Leu Phe Phe He Tyr Ala Val He Gly Met 
1365 1370 1375 

CAG GTG TTT GGG AAA ATT GCC CTG AAT GAT ACC ACA GAG ATC AAC CGG 4410 
Gin Val Phe Gly Lys He Ala Leu Asn Asp Thr Thr Glu He Asn Arg 
"80 1385 1390 1395 

AAC AAC AAC TTT CAG ACC TTC CCC CAG GCC GTG CTG CTC CTC TTC AGG 4458 
Asn Asn Asn Phe Gin Thr Phe Pro Gin Ala val Leu Leu Leu Phe Arg 
1400 1405 1410 

TGT GCC ACC GGG GAG GCC TGG CAG GAC ATC ATG CTG GCC TGC ATG CCA 4506 
Cys Ala Thr Gly Glu Ala Trp Gin Asp He Met Leu Ala Cys Met Pro 
1415 1420 1425 

GGC AAG AAG TGT GCC CCA GAG TCC GAG CCC AGC AAC AGC ACG GAG GGT 4554 
Gly Lys Lys Cys Ala Pro Olu Ser Glu Pro Ser Asn Ser Thr Glu Gly 
1430 1435 1440 

GAA ACA CCC TGT GGT AGC AGC TTT GCT GTC TTC TAC TTC ATC AGC TTC 4602 
Glu Thr Pro Cys Gly Ser Ser Phe Ala Val Phe Tyr Phe He Ser Phe 
1445 1450 1455 

TAC ATG CTC TGT GCC TTC CTG ATC ATC AAC CTC TTT GTA GCT GTC ATC 4650 
Tyr Met Leu Cys Ala Phe Leu He He Asn Leu Phe Val Ala Val He 
1460 1465 1470 1475 

ATG GAC AAC TTT GAC TAC CTG ACA AGG GAC TGG TCC ATC CTT GGT CCC 4698 
Met Asp Asn Phe Asp Tyr Leu Thr Arg Asp Trp Ser He Leu Gly Pro 
1480 1485 1490 

CAC CAC CTG GAT GAG TTT AAA AGA ATC TGG GCA GAG TAT GAC CCT GAA 4746 
His His Leu Asp Glu Phe Lys Arg He Trp Ala Glu Tyr Asp Pro Glu 
1495 1500 1505 



GCC AAG GGT CGT ATC AAA CAC CTG GAT GTG GTG ACC CTC CTC CGG CGG 
Ala Lys Gly Arg He Lys His Leu Asp Val Val Thr Leu Leu Arg Arg 
1510 1515 1520 



4794 
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ATT CAG CCG CCA CTA GGT TTT GGG AAO CTG TGC CCT CAC CGC GTG GOT 4842 
lie Gin Pro Pro Leu Gly Phe Gly Lys Leu Cys Pro His Arg Val Ala 
1525 1530 1535 

TGC AAA CGC CTG GTC TCC ATG AAC ATG CCT CTG AAC AGC GAC GGG ACA 4890 
cys Lys Arg Leu Val Ser Met Asn Met Pro Leu Asn Ser Asp Gly Thr 
«40 1545 1550 1555 

GTC ATG TTC AAT GCC ACC CTG TTT GCC CTG GTC AGG ACG GCC CTG AGG 4938 
Val Met Phe Asn Ala Thr Leu Phe Ala Leu Val Arg Thr Ala Leu Arg 
1560 1565 1570 

ATG AAA ACA GAA GGG AAC CTA GAA CAA GCC AAT GAG GAG CTG CGG GCG 4986 
He Lys Thr Glu Gly Asn Leu Glu Gin Ala Asn Glu Glu Leu Arg Ala 
1575 1580 1585 

ATC ATC AAG AAG ATC TGG AAG CGG ACC AGC ATG AAG CTG CTG GAC CAG 5034 
lie lie Lys Lys He Tzp Lys Arg Thr Ser Met Lys Leu Leu Asp Gin 
1590 1595 1600 

GTG GTG CCC CCT GCA GGT GAT GAT GAG GTC ACC GTT GGC AAG TTC TAG 5082 
Val Val Pro Pro Ala Gly Asp Asp Glu Val Thr Val Gly Lys Phe Tyr 
1605 1610 1615 

GCG ACG TTC CTG ATC CAG GAG TAG TTC GGG AAG TTC AAG AAG CGC AAA 5130 
Ala Thr Phe Leu He Gin Glu Tyr Phe Arg Lys Phe Lys Lys Arg Lys 
1620 1625 1630 1635 

GAG CAG GGC CTT GTG GGC AAG CCC TCC GAG AGG AAC GCG CTG TCT CTG 5178 
Glu Gin Gly Leu Val Gly Lys Pro Ser Gin Arg Asn Ala Leu Ser Leu 
1640 1645 1650 

CAG GCT GGC TTG CGC ACA CTG CAT GAC ATC GGG CCT GAG ATC CGA CGG 5226 
Gin Ala Gly Leu Arg Thr Leu His Asp He Gly Pro Glu He Arg Arg 
1655 1660 1665 

GCC ATC TCT GGA GAT GTC ACG GCT GAG GAG GAG GTG GAC AAG GCC ATG 5274 
Ala He Ser Gly Asp Leu Thr Ala Glu Glu Glu Leu Asp Lys Ala Met 
1670 1675 1680 

AAG GAG GCT GTG TCC GGT GCT TCT GAA GAT GAC ATC TTC AGG AGG GCC 5322 
Lys Glu Ala Val Ser Ala Ala Ser Glu Asp Asp He Phe Arg Arg Ala 
1685 1690 1695 

GGT GGC CTG TTC GGC AAC CAC GTC AGC TAG TAG CAA AGC GAC GGC CGG 5370 
Gly Gly Leu Phe Gly Asn His Val Ser Tyr Tyr Gin Ser Asp Gly Arg 
1700 1705 1710 1715 

AGC GCC TTC CCC CAG AGC TTC ACC ACT CAG CGC CCG CTG CAC ATG AAC 5418 
Ser Ala Phe Pro Gin Thr Phe Thr Thr Gin Arg Pro Leu His He Asn 
1720 1725 1730 

AAG GCG GGC AGC AGC CAG GGC GAC ACT GAG TGG CCA TCC CAC GAG AAG 5466 
Lys Ala Gly Ser Ser Gin Gly Asp Thr Glu Ser Pro Ser His Glu Lys 
1735 1740 1745 
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CTG GTG GAC TCC ACC TTC ACC CCG AGC AGC TAC TCG TCC ACC GGC TCC 5514 
Leu Val Asp Ser Thr Phe Thr Pro Ser Ser Tyr Ser Ser Thr Gly Ser 
1750 1755 1760 

AAC GCC AAC ATC AAC AAC GCC AAC AAC ACC GCC CTG GGT CGC CTC OCT 5562 
Asn Ala Asn He Asn Asn Ala Asn Asn Thr Ala Leu Gly Arg Leu Pro 
1765 1770 1775 

CGC CCC GCC GGC TAC CCC AGC ACG GTC AGC ACT GTG GAG GGC CAC 6GG 5610 
Arg Pro Ala Gly Tyr Pro Ser Thr Val Ser Thr Val Glu Gly His Gly 
1780 1785 1790 1795 

CCC CCC TTG TCC CCT GCC ATC CGG GTG CAG GAG GTG GCG TGG AAG CTC 5658 
Pro Pro Leu Ser Pro Ala He Arg Val Gin Glu Val Ala Trp Leu 
1800 1805 1810 

AGC TCC AAC AGG TGC CAC TCC CGG GAG AGC CAG GCA GCC ATG GCG GGT 5706 
Ser Ser Asn Arg Cys His Ser Arg Glu Ser Gin Ala Ala Met Ala Gly 
1815 1820 1825 

^ ^ ii^® ^ TA'^ «5TG AAG ATG AAC CAT 5754 

Gin Glu Glu Thr Ser Gin Asp Glu Thr Tyr Glu Val Lys Met Asn His 
1830 1835 1840 

^ ^ ^® AGC CTG CTC TCC ACA GAG ATG CTC 5802 

Asp Ito Glu Ala Cys Ser Glu Pro Ser Leu Leu Ser Thr Glu Met Leu 
1845 1850 1855 

GAC GAA AAT CGG CAA CTG ACG CTC CCA GAG GAG GAC 5850 
^t^J^ ^» Arg Gin Leu Thr Leu Pro Glu Glu Asp 

I860 1865 1870 1875 

AAG AGG GAC ATC CGG CAA TCT CCG AAG AGG GGT TTC CTC CGC TCT GCC 5898 
Lys Arg Asp He Arg Gin Ser Pro Lys Arg Gly Phe Leu Arg Ser Ala 
1880 1885 1890 

TCA CTA GGT CGA AGG GCC TCC TTC CAC CTG GAA TGT CTG AAG CGA CAG 5946 
Ser Leu Gly Arg Arg Ala Ser Phe His Leu Glu Cys Leu Lys Arg Gin 
1895 1900 1905 

AAG GAC CGA GGG GGA GAC ATC TCT CAG AAG ACA GTC CTG CCC TTG CAT 5994 
Lys Asp Arg Gly Gly Asp He Ser Gin Lys Thr Val Leu Pro Leu His 
1510 1915 1920 

CTG GTT CAT CAT CAG GCA TTG GCA GTG GCA GGC CTG AGC CCC CTC CTC 6042 
Leu Val His His Gin Ala Leu Ala Val Ala Gly Leu Ser Pro Leu Leu 
1925 1930 1935 

CAG AGA AGC CAT TCC CCT GCC TCA TTC CCT AGG CCT TTT GCC ACC CCA 6090 
Gin Arg Ser His Ser Pro Ala Ser Phe Pro Arg Pro Phe Ala Thr Pro 
1^40 1945 1950 1955 

CCA GCC ACA CCT GGC AGC CGA GGC TGG CCC CCA CAG CCC GTC CCC ACC 6138 
Pro Ala Thr Pro Gly Ser Arg Gly Trp Pro Pro Gin Pro Val Pro Thr 
1960 1965 1970 
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CTG CGG CTT GAG GGG GTC GAG TCC AGT GAG AAA CTC AAC AGC AGC TTC 618fi 
Leu Arg Leu Qlu Gly Val Glu Ser Ser Glu Lys Leu Asn Ser Ser Phe 
1^75 1980 1985 

CCA TCC ATC CAC TGC GGC TCC TGG GCT GAG ACC ACC CCC GGT GGC GGG 6234 
Pro Ser lie His Cys Gly Ser Trp Ala Glu Thr Thr Pro Gly Gly Glv 
19^0 . 1995 2000 ^ ^ ^ 

GGC AGC AGC GCC GCC CGG AGA GTC CGG CCC GTC TCC CTC ATG GTG CCC 6282 
Gly Ser Ser Ala Ala Arg Arg Val Arg Pro Val Ser Leu Met Val Pro 
2005 2010 2015 

AGC CAG GCT GGG GCC CCA GGG AGG CAQ TTC CAC GGC AGT GCC AGC AGC 6330 
Ser Gin Ala Gly Ala Pro Gly Arg Gin Phe His Gly Ser Ala Ser Ser 
2020 2025 2030 2035 

?^ QAA GCG GTC TTG ATT TCA GAA GGA CTG GGG CAG TTT GGT CAA 6378 
Leu Val Glu Ala Val Leu He Ser Glu Gly Leu Gly Gin Phe Ala Gin 
2040 2045 2050 

GAT CCC AAG TTC ATC GAG GTC ACC ACC CAG GAG CTC GCC GAC GCC TGC 6426 
Asp Pro Lys Phe He Glu Val Thr Thr Gin Glu Leu Ala Asp Ala Cys 
2055 2060 2065 

GAC ATG ACC ATA GAG GAG ATG GAG AGC GCG GCC GAC AAC ATC CTC AGC 6474 
Asp Met Thr He Glu Glu Met Qlu Ser Ala Ala Asp Asn He Leu Ser 
2070 2075 2080 

GGG GGC GCC CCA CAG AGC CCC AAT GGC GCC CTC TTA CCC TTT GTG AAC 6522 
Gly Gly Ala Pro Gin Ser Pro Asn Gly Ala Leu Leu Pro Phe Val Asn 
2085 2090 2095 

TCC AGG GAC GCG GGG CAG GAC CGA GCC GGG GGC GAA GAG GAC GCG GGC 6570 
Cys Arg Asp Ala Gly Gin Asp Arg Ala Gly Gly Glu Glu Asp Ala Gly 
2100 2105 2110 2115 

TGT GTC CGC GCG CGG GGT CGA CCG AGT GAG GAG GAG CTC CAG GAC AGC 6618 
Cys Val Arg Ala Arg Gly Arg Pro Ser Glu Glu Glu Leu Gin Asp Ser 
2120 2125 2130 

AGG GTC TAC GTC AGC AGC CTC TAGTGGGCGC TCCCAGAT6C GGGCTTTTTT 6669 
Arg Val Tyr Val Ser Ser Leu 
2135 

TTATTTGTTT CAATGTTCCT AATGGGTTCG TTTCAGAAGT GCCTCACTCT TCTCGT 6725 



(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



-270- 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 502.. 2316 

(D) OTHER INFORMATION: /standard_name= "Beta-2C" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CAGCAGCGTG CTAAGAAOCA GTCACATAAA CAGCAGCAGG AGTAGGCCTC CTGCTTTTCA 60 

AAAGCAGAGT ACTGCAGGGT CGCGAAATGC AAQACACTCA GATGTTTGAA AATCTCCCGA 120 

GTTGAGAATG GCTACTGTAA AAGCGTCACC AAGAAACTCT GACGATCTG6 ACAGTCCTAA 180 

CTCTGTGTTA GCAATACTTA CTTCCGQAAA ATTAAT6CTA CTTCTTGTAG ATTTTTGCAA 240 

ATAGGAAACC CCCTTGAAGA AGATCTCAAA TTACGCCCCC CACCCCCAAA AAAAGACAAA 300 

CAGGGGAGAA CAAAGTTTTG GCATGCCTGC AGGAACGGTG GCTTTTTTAG AAACTACCTA 360 

GGAGGCAGAA GCTAAGTGAT TTGCTCATGC CTCTTACCTG GGAGTAGAAQ GTGGGAAGAA 420 

ATGGACCGAG 6CTGTGACGA GAAGACAAGG CACAGTGCAG CTTGGTGAAG CCACACGCTG 480 

ACTGCGTTCT GCCCCCTCTT C ATG CAG TGC TGC GGG CTG GTG CAT CGC CGG 531 

Met Gin Cys Cys Qly Leu Val His Arg Arg 
15 10 

CGA GTA CGG GTG TCC TAT GGT TCG GCA GAC TCC TAC ACT AGC CGT CCA 579 
Arg Val Arg Val Ser Tyr Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro 
15 20 25 

TCC GAT TCC GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG CGC AGA 627 
Ser Asp Ser Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg 
30 35 40 

GAA GCG GAG CGG CAG GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG 675 
Glu Ala Glu Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys 
45 50 55 

CCC GTT GCA TTT GCG GTT CGG ACA AAT QTC AGC TAC AGT GCG GCC CAT 723 
Pro Val Ala Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His 
60 65 70 

GAA GAT GAT GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA MA 771 
Glu Asp Asp Val Pro Val Pro Gly Met Ala He Ser Phe Glu Ala Lys 
75 80 85 90 

GAT TTT CTG CAT GTT AAG GAA AAA TTT AAC AAT GAC TOG TGG ATA GGG 819 
Asp Phe Leu His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp He Gly 
95 100 105 

CGA TTG GTA AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA 6TC 867 
Arg Leu Val Lys Glu Gly Cys Glu He Gly Phe He Pro Ser Pro Val 
110 115 120 
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AAA CTA GAA AAC ATG AG6 CTG CAG CAT GAA GAG AGA GCC AAG CAA GGG 915 
Lya Leu Glu Asn Met Aig Leu Gin His Glu Gin Arg Ala Lys Gin Gly 
125 130 135 

AAA TTC TAC TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT 963 
Lya Phe Tyr Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly 
140 145 ISO 

GAC ATA GTA COT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT ATA 1011 
Asp lie Val Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala lie 
155 160 165 170 

GAC ATA GAT GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA 1059 
Asp lie Asp Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp lie Pro Ala 
175 180 185 

AAC CAC CGC TCC CCT AAA CCC AGT GCA AAC AGT GTA ACG TCA CCC CAC 1107 
Asn His Arg Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His 
150 195 200 

TCC AAA GAG AAA AGA ATG CCC TTC TTT AAG AAG ACA GAG CAC ACT CCT 1155 
Ser Lys Glu Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro 
205 210 215 

CCG TAT GAT GTG GTA CCT TCC ATG CGA CCA GTG GTC CTA GTG GGC CCT 1203 
Pro Tyr Asp Val Val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro 
220 225 230 

TCT CTG AAG GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT 1251 
Ser Leu Lys Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe 
235 240 245 250 

GAT TTT TTA AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC 1299 
Asp Phe Leu Lys His Arg Phe Glu Gly Arg He Ser He Thr Arg Val 
255 260 265 

ACC GCT GAC ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT 1347 
Thr Ala Asp He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser 
270 275 280 

AAG CAC GCA ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA 1395 
Lys His Ala He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu 
285 290 295 

GTT CAG AGT GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG 1443 
Val Gin Ser Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin 
300 305 310 

TTG GTG CTC CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT 1491 
Leu Val Val Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser 
315 320 325 330 

AAA ACC TCC TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT 1539 
Lys Thr Ser Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro 
335 340 345 
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AAG GTT TTA CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA 1587 
Lys Val lieu Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala Lys 
350 355 360 

CAC CTC AAC GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT 1635 
His Leu Asn Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro 
365 370 375 

CCA GAG CTG TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC 1683 
Pro Glu Leu Pbe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala 
380 385 390 

TGT GAG CAC CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT 1731 
Cys Glu His Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His 
395 400 405 410 

CCT CCC AGC AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC 1779 
Pro Pro Ser Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala 
415 420 425 

ACT TCA AGT CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT 1827 
Thr Ser Ser Leu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly 
430 435 440 

TCT CAA GGT GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT 1875 
Ser Gin Gly Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala 
445 450 455 

TCC CAA GCT 6AA GAA GAA CCT AGT GTG GAA CCA GTC AAG AAA TCC CAG 1923 
Ser Gin Ala Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin 
460 465 470 

CAC CGC TCT TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT GGG ACA 1971 
His Arg Ser Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr 
475 480 485 490 

AGT CGC GGC CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG 2019 
Ser Aig Gly Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu 
495 500 505 

AGT CGA GAC TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC 2067 
Ser Arg Asp Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp 
510 515 520 

CAC GTG GAC CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC AGA GAC GAG 2115 
His Val Asp His Tyr Ala Ser His Arg Ajsp His Asn His Arg Asp Glu 
525 530 535 

ACC CAC GGG AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC 2163 
Thr His Gly Ser Ser Asp Bis Arg His Arg Glu Ser Arg His Arg Ser 
540 545 550 

CGG GAC GTG GAT CGA GAG CAG GAC CAC AAC GAG TGC AAC AAG CAG CGC 2211 
Arg Asp Val Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg 
555 560 565 570 
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AGC COT CAT AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG 2259 
Ser Arg His Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val 
575 580 585 

ATA TCA AAA AAA CGG AAT GAG GOT GGG GAG TGG AAC AGG GAT GTT TAC 2307 
He Ser Lys Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr 
590 595 600 

ATC CCC CAA TGAGTTTTGC CCTTTTGTGT T lTnTlTl'T TTTTTTTTGA 2356 
He Pro Gin 
605 

AGTCTTGTAT AACTAACAGC ATCCCCAAAA CAAAAAGTCT TTGGGGTCTA CACTCCAATC 2416 

ATATGTGATC TGTCTTGTAA TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG 2476 

GATAGAGTAT TGAGATACTT TTTCTTTTGT AAGTGCTACA TAAATTGGCC TGGTATGGCT 2536 

GCAGTCCTCC GGTTOCATAC TGGACTCTTC AAAAACTGTT TTGGGTAGCT GCCACTTGAA 2596 

CAAAATCTGT TGCCACCCAG GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCAA 2656 

CAAGCCAGAA TCAGCACAGA TAAAAAGTGG AATTTCTTGT TTCTCCAGAT TTTTAATACG 2716 

TTAATACGCA GGCATCTGAT TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTCTACC 2776 

TCTGGCTGAC TAAATTTGGG GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG 2836 

TTAGAATCTA TTTTCTATGT ACTAGTACTG TGTACTGTAT AGACAGTTTG TAAATGTTAT 2896 

TTCTGCAAAC AAACACCTCC TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC 2956 
ACTATTTTAG AQTC 

(2) INFORMATION FOR SEQ ID NO: 38: 



2970 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2712 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 223.. 2061 

(D) OTHER INFORMATION: /standardj:iame« "Beta-2E" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
A6TGTGTGTT TTCAGCCCCT CCTGGAATGG GAAAATAAGA ATCTCCCT6G ATGGGA6TCC 60 
TCTGGGGCAG GGAGTGAAAG CCCCGGAGQC AGAAAGGGAC 6GAGAACAGG GGCTTGCCCA 120 
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GAGCATGGAT AGGAAAGGAG CTGGGGTTCT CCGGGGCTCA GCGCGCACTG AGAACCTGTG 180 

CCCGGGGCTG CAGCTGCGGA CGATAAAGGC GCTGTCTGGC TC ATG AA6 GCC ACC 234 

Met Lys Ala Tbr 
1 

TGG ATC A6G CTT CTG AAA AGA GCC AAG G6A GGA AGG CTG AAG AAT TCT 282 
Trp lie Arg Leu Leu Lys Arg Ala Lys Gly Gly Arg Leu Lys Asn Ser 
5 10 15 20 

GAT ATC TOT GGT TCG OCA GAC TCC TAG ACT AGC CGT CCA TCC GAT TCC 330 
Asp lie Cys Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser 

25 30 35 

GAT GTA TCT CTG GAG GAG GAC CGG GAG GCA GTG C6C AGA GAA GCG GAG 378 
Asp Val Ser Leu Glu Glu Asp Arg Glu Ala Val Arg Arg Glu Ala Glu 
40 45 50 

CGG CA6 GCC CAG GCA CAG TTG GAA AAA GCA AAG ACA AAG CCC GTT GCA 426 
Arg Gin Ala Gin Ala Gin Leu Glu Lys Ala Lys Thr Lys Pro Val Ala 
55 60 65 

TTT GCG GTT CGG ACA AAT GTC AGC TAC AGT GCG GCC CAT GAA GAT GAT 474 
Phe Ala Val Arg Thr Asn Val Ser Tyr Ser Ala Ala His Glu Asp Asp 
70 75 80 

GTT CCA GTG CCT GGC ATG GCC ATC TCA TTC GAA GCA AAA GAT TTT CTG 522 
Val Pro Val Pro Gly Net Ala lie Ser Phe Glu Ala Lys Asp Phe Leu 
85 90 95 100 

CAT GTT AAG GAA AAA TTT AAC AAT GAC TGG TGG ATA GGG CGA TTG GTA 570 
His Val Lys Glu Lys Phe Asn Asn Asp Trp Trp lie Gly Arg Leu Val 
105 110 115 

AAA GAA GGC TGT GAA ATC GGA TTC ATT CCA AGC CCA GTC AAA CTA GAA 616 
Lys Glu Gly Cys Glu lie Gly Phe lie Pro Ser Pro Val Lys Leu Glu 
120 125 130 

AAC ATG AGG CTG CAG CAT GAA CAG AGA GCC AAG CAA GGG AAA TTC TAC 666 
Asn Met Arg Leu Gin His Glu Gin Arg Ala Lys Gin Gly Lys Phe Tyr 
135 140 145 

TCC AGT AAA TCA GGA GGA AAT TCA TCA TCC AGT TTG GGT GAC ATA GTA 714 
Ser Ser Lys Ser Gly Gly Asn Ser Ser Ser Ser Leu Gly Asp lie Val 
150 155 160 

CCT AGT TCC AGA AAA TCA ACA CCT CCA TCA TCT GCT ATA GAC ATA GAT 762 
Pro Ser Ser Arg Lys Ser Thr Pro Pro Ser Ser Ala lie Asp lie Asp 
165 170 175 180 

GCT ACT GGC TTA GAT GCA GAA GAA AAT GAT ATT CCA GCA AAC CAC C6C 810 
Ala Thr Gly Leu Asp Ala Glu Glu Asn Asp He Pro Ala Asn His Arg 
185 190 195 

TCC CCT AAA CCC AGT GCA AAC AGT GTA ACQ TCA CCC CAC TCC AAA GAG 858 
Ser Pro Lys Pro Ser Ala Asn Ser Val Thr Ser Pro His Ser Lys Glu 
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200 205 210 

AAA AGA ATG CCC TTC TTT AAG AA6 ACA GAG CAC ACT CCT CCG TAT GAT 906 
Lys Arg Met Pro Phe Phe Lys Lys Thr Glu His Thr Pro Pro Tyr Asp 
215 220 225 

GTG GTA CCT TCC ATG CGA CCA GT6 GTC CTA GTG GGC CCT TCT CTG AAG 954 
Val val Pro Ser Met Arg Pro Val Val Leu Val Gly Pro Ser Leu Lys 
230 235 240 

GGC TAC GAG GTC ACA GAT ATG ATG CAA AAA GCG CTG TTT GAT TTT TTA 1002 
Gly Tyr Glu Val Thr Asp Met Met Gin Lys Ala Leu Phe Asp Phe Leu 

245 250 255 260 

AAA CAC AGA TTT GAA GGG CGG ATA TCC ATC ACA AGG GTC ACC GCT GAC 1050 
Lys His Arg Phe Glu Gly Arg lie Ser He Thr Arg Val Thr Ala Asp 
265 270 275 

ATC TCG CTT GCC AAA CGC TCG GTA TTA AAC AAT CCC AGT AAG CAC GCA 1098 
He Ser Leu Ala Lys Arg Ser Val Leu Asn Asn Pro Ser Lys His Ala 
280 285 290 

ATA ATA GAA AGA TCC AAC ACA AGG TCA AGC TTA GCG GAA 6TT CAG AGT 1146 
He He Glu Arg Ser Asn Thr Arg Ser Ser Leu Ala Glu Val Gin Ser 
295 300 305 

GAA ATC GAA AGG ATT TTT GAA CTT GCA AGA ACA TTG CAG TTG GTG GTC 1194 
Glu He Glu Arg He Phe Glu Leu Ala Arg Thr Leu Gin Leu Val Val 
310 315 320 

CTT GAC GCG GAT ACA ATT AAT CAT CCA GCT CAA CTC AGT AAA ACC TCC 1242 
Leu Asp Ala Asp Thr He Asn His Pro Ala Gin Leu Ser Lys Thr Ser 
325 330 335 340 

TTG GCC CCT ATT ATA GTA TAT GTA AAG ATT TCT TCT CCT AAG GTT TTA 1290 
Leu Ala Pro He He Val Tyr Val Lys He Ser Ser Pro Lys Val Leu 
345 350 355 

CAA AGG TTA ATA AAA TCT CGA GGG AAA TCT CAA GCT AAA CAC CTC AAC 1338 
Gin Arg Leu He Lys Ser Arg Gly Lys Ser Gin Ala Lys His Leu Asn 

360 365 370 

GTC CAG ATG GTA GCA GCT GAT AAA CTG GCT CAG TGT CCT CCA GAG CTG 1386 
Val Gin Met Val Ala Ala Asp Lys Leu Ala Gin Cys Pro Pro Glu Leu 
375 380 385 

TTC GAT GTG ATC TTG GAT GAG AAC CAG CTT GAG GAT GCC TGT GAG CAC 1434 
Phe Asp Val He Leu Asp Glu Asn Gin Leu Glu Asp Ala Cys Glu His 
390 395 400 

CTT GCC GAC TAT CTG GAG GCC TAC TGG AAG GCC ACC CAT CCT CCC AGC 1482 
Leu Ala Asp Tyr Leu Glu Ala Tyr Trp Lys Ala Thr His Pro Pro Ser 
405 410 415 420 

AGT AGC CTC CCC AAC CCT CTC CTT AGC CGT ACA TTA GCC ACT TCA AGT 1530 
Ser Ser Leu Pro Asn Pro Leu Leu Ser Arg Thr Leu Ala Thr Ser Ser 
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425 



430 



435 



CTG CCT CTT AGC CCC ACC CTA GCC TCT AAT TCA CAG GGT TCT CAA GGT 1578 
lieu Pro Leu Ser Pro Thr Leu Ala Ser Asn Ser Gin Gly Ser Gin Gly 
440 445 450 

GAT CAG AGG ACT GAT CGC TCC GCT CCT ATC CGT TCT GCT TCC CAA GCT 1626 
Asp Gin Arg Thr Asp Arg Ser Ala Pro He Arg Ser Ala Ser Gin Ala 
455 460 465 

GAA GAA GAA CCT AGT GTG GAA CCA GTC AA6 AAA TCC CAG CAC CGC TCT 1674 
Glu Glu Glu Pro Ser Val Glu Pro Val Lys Lys Ser Gin His Arg Ser 
470 475 480 

TCC TCC TCA GCC CCA CAC CAC AAC CAT CGC AGT G6G ACA AGT CGC GGC 1722 
Ser Ser Ser Ala Pro His His Asn His Arg Ser Gly Thr Ser Arg Gly 
485 490 495 500 

CTC TCC AGG CAA GAG ACA TTT GAC TCG GAA ACC CAG GAG AGT CGA GAC 1770 
Leu Ser Arg Gin Glu Thr Phe Asp Ser Glu Thr Gin Glu Ser Arg Asp 
505 510 515 

TCT GCC TAC GTA GAG CCA AAG GAA GAT TAT TCC CAT GAC CAC GTG GAC 1818 
Ser Ala Tyr Val Glu Pro Lys Glu Asp Tyr Ser His Asp His Val Asp 
520 525 530 

CAC TAT GCC TCA CAC CGT GAC CAC AAC CAC ASA GAC GAG ACC CAC GGG 1866 
His Tyr Ala Ser His Arg Asp His Asn His Arg Asp Glu Thr His Gly 
535 540 545 

AGC AGT GAC CAC AGA CAC AGG GAG TCC CGG CAC CGT TCC CGG GAC GTG 1914 
Ser Ser Asp His Arg His Arg Glu Ser Arg His Azg Ser Arg Asp Val 
550 555 560 

GAT CGA GAG CAG GAC CAC AAC GAG T6C AAC AAG CAG CGC AGC CGT CAT 1962 
Asp Arg Glu Gin Asp His Asn Glu Cys Asn Lys Gin Arg Ser Arg His 
565 570 575 580 

AAA TCC AAG GAT CGC TAC TGT GAA AAG GAT GGA GAA GTG ATA TCA AAA 2010 
Lys Ser Lys Asp Arg Tyr Cys Glu Lys Asp Gly Glu Val He Ser Lys 
585 590 595 

AAA CGG AAT GAG GCT GGG GAG TGG AAC AGG GAT GTT TAC ATC CCC CAA 2058 
Lys Arg Asn Glu Ala Gly Glu Trp Asn Arg Asp Val Tyr He Pro Gin 
600 605 610 

T6AGTTTT6C CCTTTTGTGT TTTTTTTTTT TTTTTTTTGA AGTCTTGTAT AACTAACA6C 2118 

ATCCCCAAAA CAAAAAGTCT TT6GGGTCTA CACTGCAATC ATATGTGATC TGTCTTGTAA 2178 

TATTTTGTAT TATTGCTGTT GCTTGAATAG CAATAGCATG GATAGAGTAT TGAGATACTT 2238 

TTTCTTTTGT AAGTGCTACA TAMTTGGCC TG6TATGGCT GCAGTCCTCC GGTTGCATAC 2298 

TGGACTCTTC AAAAACTGTT TTGGGTAGCT 6CCACTTGAA CAA7UITCT6T TGCCACCCAG 2358 
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GTGATGTTAG TGTTTTAAGA AATGTAGTTG ATGTATCCTkA CAAGCCAGAA TCAGCACAGA 2418 

TAAAAAGTGG AATTTCTTGT TTCTCCAGAT TTTTAATACG TTAATACGCA GGCATCTGAT 2478 

TTGCATATTC ATTCATGGAC CACTGTTTCT TGCTTGTACC TCTGGCTGAC TAAATTTGGG 2538 

GACAGATTCA GTCTTGCCTT ACACAAAGGG GATCATAAAG TTA6AATCTA TTTTCTATGT 2598 

ACTAOTACTG TGTACTGTAT AGACAGTTTG TAAATGTTAT TTCTGCAAAC AAACACCTCC 2658 

TTATTATATA TAATATATAT ATATATATCA GTTTGATCAC ACTATTTTAG AGTC 2712 
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IS CLAIMED IS: 

1. An isolated DNA fragment, comprising a sequence of 
nucleotides that encodes cOi subunit selected from the group 
consisting of a^.^,, oc^,^, a^.j, a^^-i and a^.j. 

2. The DNA fragment of claim 1, wherein the cr^ sub\init 
is a^^.^ or a^.^^ 

3. The DNA fragment of claim 1, wherein the subunit 
is QfiE.i or cr^B.j. 

4. The DNA fragment of claim 1, wherein the subunit 

5. An isolated DNA fragment, comprising a secjuence of 
nucleotides that encodes a 0 subunit selected from the group 
consisting of p^, /J, and p^. 

6. The DNA fragment of claim 5, wherein the subunit is 
a i^jcf 02D or fi^E subunit. 

7. The DNA fragment of claim 5, wherein the subunit is 
a p^ subunit. 

8. The DNA fragment of claim 7, wherein the subunit is 
a /?3.^ subunit. 

9. The DNA fragment of claim 5, wherein the subunit is 
a subunit. 

10. The DNA fragment of claim 9, wherein the subunit has 
an amino acid sequence set forth in SEQ ID No. 28. 

11. A eukaryotic cell, comprising heterologous DNA that 
encodes an subunit selected from the group of subunits 
consisting of a^^^n ot2,^,2i "iMr and a„.3. 

12. A eukaryotic cell, comprising heterologous DNA that 
encodes an subunit and heterologous DNA that encodes a P 
subimit, wherein at least one subunit is selected from the 
group of subunits consisting of ai^-i/ 0fiA-2# «ic-2' ofiB-w aas-s' p2C' 
^2D' ^3-1/ a P^ svibunit. 

13. The eukaryotic cell of claim 12, wherein the p 
subunit is a P2 subunit. 

14. The eulcaryotic cell of claim 12, wherein the p 
subunit is a p^ subunit. 
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15. The eukaryotic cell of claim 11, selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, and mouse L cells. 

16. The eukaryotic cell of claim 12 selected from the 
5 group consisting of HEK 293 cells, Chinese hamster ovary 

cells, African green monkey cells, and mouse L cells. 

17. A eukaryotic cell with a functional, heterologous 
calcium channel, produced by a process conprising: 

introducing into the cell heterologous nucleic acid that 
10 encodes an ai-sub\init of a human calcium channel, wherein: 

the Qf^ Bubunit is selected from the group consisting of 

OflA-l/ ^lA-2# 0^lC-2# O^lB-l aiB.3; 

the heterologous calcium channel contains at least one 
subunit encoded by the heterologous nucleic acid; and 
15 the only heterologous ion channels are calcium channels. 

18. A eukaryotic cell with a functional, heterologous 
calcium channel, produced by a process comprising: 

introducing into the cell nucleic acid that encodes an 
subunit of a human calcium channel and introducing into the 
20 cell nucleic acid that encodes a p subunit of a himian calcium 
channel, wherein: 

at least one of the subunits is s elected from the group 
consisting of a^,^, a^.^, a^,.,, of„.3, p^^, p^, /5^, a and a 
subunit ; 

25 the heterologous calcium channel contains at least one 

subunit encoded by the heterologous nucleic acid; and 

the only heterologous ion channels are calcium channels . 

19. The eukaryotic cell of claim 17 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 

30 cells, African green monkey cells, mouse L cells and axnphibian 
oocytes . 

20. The eukaryotic cell of claim 18 selected from the 
group consisting of HEK 293 cells, Chinese hamster ovary 
cells, African green monkey cells, mouse L cells and amphibian 

35 o5cytes. 



-280- 



21. The eukaryotic cell of claim 18, wherein the p 
subunit is a /Sa, or siibunit of a human calcium channel. 

22. The eukaryotic cell of claim 18, wherein the calcium 
channel includes an cr^^ subunit of a human calcium channel, an 

5 ^iB>i subunit of a human calcium channel and a jS, subunit of a 
human calcium chemnel. 

23. The eukaryotic cell of claim 18^ wherein the calcium 
channel Includes an a^.i, ajb* and a jS^.j subunit, or an ofxa-i, 
Ofjb, and a /S^.j siibunit, or an ofiB.2' ^ih' and a ^^.3 subunit, or 

10 an axA-2/ <^3b* and a fi^.^ subunit, or a ot^^^^, 03^, and an ^3.^ 
subunit . 

24. The eukaryotic cell of claim 18, wherein the 
calcium channel contains an siibunit of a human calcium 
channel, an or an subimit of a human calcium channel cmd 

15 a ^^.1, /3^.2 or /S^., subunit of a human calcium channel , 

25. A method for identifying a compound that modulates 
the activity of a calcium channel, cottprising; 

suspending a eukaryotic cell that has a fiinctional, 
heterologous calcium channel, in a solution containing the 
20 compound and a calcium channel -selective ion: 

depolarizing the cell membrane of the cell; and 
detecting the current flowing into the cell, 
wherein : 

the heterologous calcium channel includes at least one 
25 human calcium channel subunit encoded by DNA or RNA that is 

heterologous to the cell; 

at least one subunit is selected from the group 

consisting of ofi^-i. aiA-2# aiB-i# «ie-3# «ic-2# ftc/ Pzoi Ae* a ft 

subunit and a ft subunit; 
30 the ctirrent that is detected is different from that 

produced by depolarizing the same or a s\ibstantially identical 

cell in the presence of the same calcium channel selective ion 

but in the absence of the compound. 

26. The method of claim 25, wherein the heterologous DNA 
35 or RNA encodes a jS^ subunit. 
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27. The method of claim 26, wherein the heterologous DNA 
or RNA encodes a sxibunit. 

28. A sxibunit- specific antibody selected from the group 
J consisting of coitibodies that bind to an a subunit type or or 

5 subunit subtype of a hiiman calcium channels, wherein the 
subunit is an sxibunit, 

29. The antibody of claim 28, wherein antibody is 
subtype specific and the subunit is a^x, ofig and a^^. 

30. An RNA or single -stranded DNA probe of at least 16 
10 bases in length comprising at least 16 substantially 

contiguous bases from nucleic acids that encode a subunit of 
a human calcium channel selected from the group of subunits 
consisting of ct^^^^, ot^.^, 0fjK-i# Ofiw Pz-i* fiia Pid' 

15 31. The probe of claim 30 that contains at least 30 

bases that are from nucleic acids that encode a subunit of a 
human calcium channel selected from the group of sxibunits 
consisting of ajA-n ot^*2i aiB-i# «ic-2# cTieo' i^3-i# ^20^ ^20 # 02e and 
ff^ subunits. 

20 32. A method for identifying nucleic acids that encode 

a human calcium channel subunit, comprising hybridizing under 
conditions of at least low stringency a probe of claim 30 to 
a library of nucleic acid fragments, and selecting hybridizing 
fragments . ^ 

25 33. A method for identifying cells or tissues that 

express a calcium channel subunit -encoding nucleic acid, 
comprising hybridizing under conditions of at least low 
stringency a probe of claim 30 with mRNA expressed in the 
cells or tissues or cDNA produced from the mRNA, and thereby 

30 identifying cells or tissue that express mRNA that encodes the 
stibunit . 

34. A substcmtially pure human calcium channel subunit 
selected from the group consisting of aix-i*. 0^1x^2' <^iE-if ^ic-2» 

3r ft-lr PlDi P2E ^nd fi^ . 
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